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[57] ABSTRACT 

A system for managing multiple projects of a similar type 
has a global project management database for storing data 
for all participating projects which is dynamically updated 
with best current data representing best current practices 
across all participating projects in the system. Localized 
computer terminals are operated at each local site with a 
common project management program and data imported 
from the global project management database. Periodically, 
the local terminals export data to the global project man- 
agement database which are evaluated to determine any new 
best current practices across all participating projects and to 
update the global project management database with the new 
best current practices. Upon periodically importing data 
from the global project management database, each local- 
ized computer terminal is updated with the new best current 
practices across all participating projects. 

20 Claims, 8 Drawing Sheets 
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SYSTEM FOR MANAGING MULTIPLE 
PROJECTS OF SIMILAR TYPE USING 
DYNAMICALLY UPDATED GLOBAL 
DATABASE 

FIELD OF THE INVENTION 

This invention relates to a system for managing the 
planning and performance of multiple projects of a similar 
type, and particularly to such a system using a dynamically 
updated global database accessed by project participants of 
the system for their respective localized projects. 

BACKGROUND ART 

A project typically requires planning for and performing 
a multitude of individual steps or tasks within a given 
timeline. Such tasks may have associated therewith a 
plethora of reference data, documents, specifications, 
communications, etc. generated and used in a complex 
project. If a vendor of project management services under- 
takes a multitude of projects of a similar type, it is desirable 
to transfer the lessons learned and best practices developed 
in prior projects to new or ongoing projects. It is also 
desirable to systematize project planning and documentation 
across all projects so that services of uniform high quality 
and integrity can be delivered. 

For example, in project management involving nuclear 
power plants, it is desirable for the vendor of project 
management services to carry over reference data, 
documentation, and best practices developed on previous 
projects into new or ongoing projects such as construction of 
a nuclear power plant, or performance of a refueling and 
maintenance outage. Projects of a similar type should use a 
similar planning methodology and task lists for 
performance, refer to the same regulations and specifications 
of applicable government agencies, suppliers, and 
contractors, and employ much of the same forms and other 
documentation. Each project also has individual localized 
characteristics, such as its siting, capacity, customer 
requirements, etc., and often valuable lessons are learned or 
best practices are developed in the course of performing a 
particular project that could be shared with other projects. 

In conventional practice, a vendor typically standardizes 
an approach to project planning and performance and trains 
its project managers and employees accordingly. Once a 
project team is assigned to work on a particular project, 
however, many individual practices and decisions are taken 
on a daily basis, and project activities, data, and documen- 
tation tend to be generated and used locally. This localiza- 
tion of project activities tends to isolate similar projects from 
each other and inhibits the sharing of information and 
standardization of practices across all projects. Promoting 
communication between projects or sharing files and docu- 
ments from a conventional database via a network would not 
completely overcome the problems of localization since 
each project is likely to be performed or documented in 
different ways from each other. 

SUMMARY OF THE INVENTION 

It is therefore a principal object of the invention to provide 
a system for managing multiple projects of a similar type 
which would allow common reference data, documentation, 
best practices, etc. to be used across all projects. It is a 
specific object that such common reference data, 
documentation, best practices, etc. be updated dynamically 
from all projects and used locally at each participating 
project. 



In accordance with the present invention, a system for 
managing multiple projects of a similar type comprises: 

(a) a global project management database for storing data 
for all projects participating in said system, said data- 
base being maintained and updated dynamically with 
best current data representing best current practices 
across all participating projects in said system; 

(b) a plurality of localized computer terminals, each 
having a microprocessor for executing a project man- 
agement program for a participating project associated 
with each said terminal at a respective local site and a 
memory for storing and retrieving data obtained from 
said global project management database for use in the 
project management program at the respective local 

15 site; 

(c) data access means operative between said global 
project management database and each one of the 
localized computer terminals for periodically importing 

20 data from said global project management database, 
including best current data across all projects, for use in 
the participating projects, and for periodically export- 
ing data from the participating projects to said global 
project management database; and 
25 (d) updating means operative in conjunction with said 
global project management database for allowing 
evaluation of said data periodically exported from the 
localized computer terminals of the participating 
projects, determining whether any such data constitutes 
30 new best current practices across all participating 
projects, and updating the best current data of said 
global project management database with the data 
determined to constitute new best current practices 
across all participating projects, said new best current 
35 practices then being imported by the localized com- 
puter terminals from said global project management 
database, 

whereby said global project management database is 
updated dynamically with best current data represent- 
40 ing new best current practices across all participating 
projects of said system, and the localized computer 
terminals import and use the new best current practices 
at the respective participating projects. 
In a preferred embodiment of the invention, the above- 
45 described system maintains best current lists of plans, 
inputs, documents, practices, critical success factors, etc. 
across all projects at the global project management data- 
base. These best lists are loaded in the localized computer 
terminals during periodic data import cycles for use by the 
50 participating projects at their local sites. Reports on lessons 
learned or improved practices developed at any local site are 
exported to the global database for review by a mission 
review group and added to the best lists maintained at the 
global database if determined to represent a best current 
55 practice across all sites. The reports may be exported to the 
global database as a separate communication or included in 
a project data export cycle. In a system employing stand- 
alone or portable computers as localized terminals, data 
import and export may be performed by exchange of data 
60 storage media on a periodic basis. In a system employing 
localized terminals connected to a network, data import and 
export may be performed during log on/off cycles. 

The preferred system employs a uniform directory and file 
structure and a common project management program for 
65 the global project management database and localized com- 
puter terminals. The project management program is used at 
the local sites as a list manager for the best lists obtained 
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from the global database, and its on-site functions include 
file management, revision control, creating links to reference 
data and documents, report preparation and handling, and 
data import/export. The common project management pro- 
gram is operable in a Project Planning Mode and a Project 
Performance Mode. 

As a specific feature of the invention, when the local 
terminals are to import best current data from the global 
database, a revision control procedure is performed. When 
new best lists containing new best current practices are 
obtained from the global database for a local project which 
is in process, the new best current practices are merged into 
the lists of the local project using a Combine function that 
carries over the best current practices already performed at 
the local site and adds the new best current practices which 
remain to be performed. In this manner, new best current 
practices gained from widely dispersed projects are made 
available to a local project on a dynamic basis, and a record 
of the best current practices actually performed at the local 
project is maintained. 

Other objects, features and advantages of the present 
invention are described in further detail below, with refer- 
ence to the following drawings: 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is schematic diagram of the overall architecture for 
a system for managing the planning and performance of 
multiple projects of a similar type in accordance with the 
present invention. 

FIG. 2 illustrates the user interface for the project man- 
agement program. 

FIG. 3 is a schematic diagram of a basic procedure for list 
management in the project management program. 

FIG. 4 is a data flow diagram illustrating a Revision 
Control procedure for updating the project management 
program at local sites. 

FIG. 5 illustrates a pre -formatted utility for the Setup/ 
Update procedure for updating the project management 
program at local sites. 

FIG. 6 illustrates a pre-formatted utility for reporting 
Lessons Learned at a local site. 

FIG. 7 illustrates a pre-formatted utility for reporting 
Feedback from a local site. 

FIG. 8 is a logic diagram for a Combine function for 
merging task lists from two project files. 

DETAILED DESCRIPTION OF A PREFERRED 
IMPLEMENTATION 

The following is a detailed description of a preferred 
implementation of the system of the present invention. In 
this implementation, a refueling and maintenance outage at 
a nuclear power plant is used as an example of a project of 
similar type which is planned and performed at dispersed 
local sites using a dynamically updated global database. It is 
to be understood that the invention is equally applicable to 
a wide range of other types of projects which have the 
characteristic of being of similar type but performed by 
different participants at different locations. 

System Architecture 

Referring to FIG. 1, the overall architecture for a multi- 
project system is illustrated. A global database 1 is main- 
tained by a project management services vendor to store all 
of the data used by the system. The global database 1 is used 
(accessed) by a plurality of localized computer terminals 3 
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each of which is associated with a participating project being 
carried out for a customer at a respective local site. Each 
local terminal is used by participants of the project man- 
agement team assigned to the local site. Each project is 
planned for or contracted with a customer, i.e., an electric 
utility company, responsible for the project to be performed 
at the local site. 

The global database 1 may be maintained, for example, on 
a workstation or server at a central office of the project 
management services vendor. The local terminals 3 may be 
standalone or portable computers or computer terminals 
connected by a network to the central server. The local 
terminals have a microprocessor for executing a common 
project management program used by the system and mass 
memory for storing data obtained from the global database 
1 as well as data generated and/or used locally. Data is 
shared (imported/exported) between the local terminals and 
the central database 1 via a data transfer connection 2 which 
may be a WAN or LAN network, an e-mail or file transfer 
connection, or physical exchange of data storage media. 

A project management program is used to operate the 
global database 1 in common with the local terminals 3. 
Since a common program is used, all local sites send/receive 
and maintain their project information in a common format 
and with complete file compatability with the vendor's 
central server and with all other project sites. This provides 
a central functionality to the system to manage projects of a 
similar type. Information on best current practices and 
lessons learned can be gathered (exported) dynamically at 
the central server from all local projects of a similar type, 
and then shared among all local projects (imported) through 
a local-site program updating procedure to be described in 
further detail below. 

The common project management program is preferably 
written as a customized application of a conventional data- 
base program which will run on servers, PCs, and laptops 
alike, for example, MS Access™ 2.0 for Windows 3.1 or 
DOS 6.0 sold by Microsoft Corp., of Redmond, Wash. The 
program interface allows a user to select program databases 
and display lists which are stored in tables within the 
program databases. The lists can have pointers to project 
files and reference files which are stored in the uniform 
directory and file structure imported from the database. The 
program allows the user to create, add, edit, and delete items 
from the lists, and to generate, import, and export 
documents, reports, and other files. The program encom- 
passes list management functions and database management 
functions. 

At the central server, the project management program is 
used to maintain the global database 1 including the follow- 
ing components: 

(1) Mission Database stores project data exported from 
each of the participating projects of the system, as well 
as best current data obtained and evaluated across all 
projects of the system. Participating projects include 
similar projects already completed and those being 
planned or in the process of being performed, as well 
as projects of other types grouped according to similar 
type. Best current data include lists of plans, tasks, 
documents, practices, critical success factors, etc. 
determined to constitute best current practices across 
all projects. 

(2) Generic Database stores data for the project manage- 
ment services vendor which is usable by any partici- 
pating project, for example, form documents, contracts, 
sales information, and equipment specifications. 
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(3) Poles Database stores customer data and site specific 
data applicable to the participating projects. 

(4) Reference and Other Databases store reference data 
usable by any participating project of the system. Such 
reference data can include, for example, applicable 
codes, standards, and regulations, documents, and com- 
mon lists or processes usable in any project. 

Program Organization 

FIG. 2 illustrates the main screen display of the user 
interface for the example of the project management pro- 
gram as used for construction and maintenance projects of 
similar type at multiple nuclear plant sites maintained by a 
nuclear services vendor (referred to by the applicant as the 
PATH™ program). The buttons in the display represent 
various modules of the global database 1. For this specific 
application, the database modules are broken down into the 
following groupings: 

1. Codes, Standards & Regulations 

2. Customers (customer-specific data) 

3. Poles (site-specific data) 

4. Model: Proposal PATH (project proposals) Mode 2: 
Project PATH (project tracking) 

5. Missions, Products, Projects & Teams (business- 
specific data) 

6. People (correspondence data) 

7. Processes & Tools (commonly used file procedures) 
The main interface includes pull-down menus for func- 
tions that can be performed with respect to the modules of 
the global database. These are described briefly below. 

The File menu provides functionality to exit the PATH 
application, close windows, set program configuration, and 
print and send reports. Program configuration controls how 
often "Critical Success Factors" are displayed on startup, 
and the LAN drive letter used for opening Mission and Pole 
Databases while connected to the LAN. The print and send 
function also includes print preview and print setup func- 
tions. The print functions allow paper printouts. The send 
function provides the ability to send information via MS 
Mail system to other project members. 

The Edit menu provides data editing, checklist item and 
attachment maintenance, and project file linking function- 
ality. Data editing is supported by Undo, Cut, Copy and 
Paste menu items. Checklist item and attachment mainte- 
nance is supported by Add and Delete menu items. Project 
file linking is support by an Add and Delete link function. 

The View menu provides selection for checklist items, 
attachments, and project files. Checklist items can be 
selected on Applicability, Planned or Closed. Attachments, 
which are associated with a Checklist item, can be selected 
on the type of file (reference files, or project specific files), 
or can present all attachments across all project. Project files 
are selected by the phase (three phases of proposal, and the 
single project execution phase) of the project. 

The Sort menu provides optional functions for indexing of 
project files. 

The Project menu provides the functions to select differ- 
ent projects, change project specific information, and main- 
tain projects within PATH. Since PATH stores multiple 
projects, a selection feature allows the changing between 
projects. Project specific information means project title, 
charge numbers, manager names, customer, site, start and 
end dates, order amount and comments. Maintenance of 
projects includes: 

Create New Project. This creates new projects from 
Project Master Templates or existing projects. 



10 



is 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Combine Projects. This features combines projects 
together to create a new project (explained in detail 
below). 

Delete Project. This deletes a project and related infor- 
mation. 

Import Projects. This imports project data files, created 
with the PATH Export. 

Export Projects. This menu item exports all related project 
data, including project information, checklist items, 
attachments and project files together into one file, 
which can be easily transmitted between users (via 
floppy disk, network or mail system). 

The Search menu provides the functions to filter and 
select both checklist items and attachments (reference and 
project files). 

The Go To menu allows a jump to all the data provided 
by the Global PATH screen, in addition to Critical Success 
Factors and several key processes, from any of the PATH 
screens. 

The Utility menu provides three key communication 
tools: 

Send PATH Feedback. This standardizes the process and 
format used to transmit ideas from the user back to 
designers and maintainers of PATH (explained in detail 
below). 

Send Lessons Learned Report. This standardizes the pro- 
cess and format used to transmit Lessons Learned back 
to Business or Process owners (explained in detail 
below). 

Send Competitive Information. This standardizes the for- 
mat used to transmit Competitive Information back to 
a central source who is responsible for collecting, 
evaluating and disseminating to the appropriate people. 
The Help menu provides on-line content sensitive help for 
using and understand how to use PATH. In addition, 
Who2Call fists help project teams know where to go for 
information. 

Use of the common Project PATH file structure provides 
standard filing convention and information structure for all 
members of a Project Team. Communication of information 
to Project Team members can be exported or linked to other 
geographic locations electronically. This information can 
originate from individual local data bases, from the central 
active file database, from an archieval file of a previous 
project maintained on a central database, or from another 
related project on a local database. Export of files are easily 
made via attachments to standard MSMAIL using existing 
available software. 

All project files created for the current project are shown 
within the Project Performance section of Project PATH. 
The system PATH is structured to keep track of all files 
opened for this specific project and stores these filenames 
and their directory structure in one location for ease of 
retreival. PATH links these files and directories to other 
systems in the Windows environment for use with other 
databases or to communicate to Team members. Those 
tasks/files which have not been closed/completed are auto- 
matically linked to the Closure section of PATH. Completion 
of the task/file is necessary before the project is done/closed 
out. Since PATH keeps account of the open items/issues 
during the project, it is unnecessary to manually keep track 
of the tasks. 

Reference data linking is implemented by storing the 
name of the reference or project specific data file in the 
database. When the file is opened within PATH, the appli- 
cation associated with the data file is launched, and the file 
is loaded. 
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When any given project is to be initiated, copies of the ing procedure performed periodically at the local sites to 

project management program and those portions of the obtain the most current (new) best practices, lessons learned, 

component databases applicable to the project are exported and other project information gathered at the central server's 

or copied from the global database 1 and installed on a local database. For the PATH Project example described herein, 

terminal 3 which will be used for managing the local project. 5 the program updating procedure is implemented through a 

These programs and component databases are primarily in Revision Control procedure executed by a user at a local 

the form of common-format master lists, data files, and terminal. The Revision Control procedure can be initiated 

references. The project management program is first oper- automatically for networked systems whenever a terminal 

ated in a Project Proposal mode (Proposal PATH). The user ^ s on i0 A ^twork with the central server. Alternatively, 

startswithasetoftestlistsloadedwifhthebestcurrentda^ 10 for s ' andalone ten ™ als > " p C s and portable 

. . A , . . 4 r . , computers, project managers at the local sites can install 

then selects, edits and customizes hem to specify the tasks ^ y J si ^ s f ^ 6 ject m mem at 

r D uiremenr m * P< riodic interVals £r0m ***** SeDt &0m the V ' nd ° r ' S 

requiremen s. central office or by downloading through an e-mail or file 

Upon acceptance by the customer, the proposal lists are c *• , *i_ » i «= t n *u 

. r , . K „ . \ „ - : ^mTin * transfer connection to the central office. In all cases, the 

then used m the Proiect Performa nee (Proiect PATH) mode is ~ . . „ A . , , . t , t V. 

, iL , i j i_ *!_ * *j *u r Revision Control procedure checks the current version of the 

where they are tracked by the program to guide the perror- , , \, 4 . , , 4 , , 

c i*. • . ™ / hi j ... program used by the terminal and guides the user through 

mance of the proiect. Reference files are stored upon instal- f, ° , . J , 4 , t ° , . . Jr 

. 4 . r , i , ,. . i j* ii_ the updating procedure to update the local program to the 

most current revision. 

FIG. 4 illustrates the basic data flow for a Revision 
Control procedure implemented in this example of the PATH 

List Management program for a project services vendor. For the purpose of 

In this example of a project management program, the Revision Control, PATH is broken into components and each 

system carries out its project tracking functions essentially component is tracked by a separate version number. The 

by fist management. A basic sequence for management of a 2 5 ma j° r components ^uig the PATH software, PATH Master 

typical list of tasks representing any task component of the Templates (which stores checklist items and reference files), 

overall project is illustrated in FIG. 3. As the project task Mission PATH Databases (which stores reference files), Pole 

component is performed, the tasks on the list are checked by PArH Databases (also stores reference files), and Processes 

the project manager and members of the project team. If a & Tools ( each Process or T ° o1 15 designated as a separate 

task is performed, it is so noted on the list. If a task is 30 com P one nt). In the following description, numbers in paren- 

modified locally, approval is obtained from the customer and theses are in the diagram to reference the sequence of 

the vendor supervisory group and the modified task is ste P s of the Revision Control procedure, 

entered on the list. If a locally modified task is superceded, All current revisions of each PATH program component 

it is deleted from the list and a new task item may be entered. are stored in the global database, labelled in the diagram as 

Checklist items that originate from a Master Template from 35 Central PAm Storage (10). A PATH Setup Utility (11), 

the Global Database are not permitted to be modified or wne n ran in Install mode, is used by each user to create a 

deleted from the list. If a master template item is to be current site-specific version of PATH on their local computer 

superceded, the item is marked "not applicable", then a new &om a current master version stored locally on a hard disk 

item can be added at the discretion of the project manager. or available through a network to the central server. The 

At the completion of the project, new items are reviewed by 40 site-specific installation includes only the components 

the vendor supervisory group. requested by the user for that particular site. 

Periodically, the site-specific project lists and related The PAp H Setup Utility (11), when ran in an Update 
project files are exported from the local terminal 3 to the mode » w* 11 P 011 the user ' s L 0 ^ 1 Computer (12), and auto- 
global database 1 where they can be reviewed and the matically detect old versions of any PATH component as 
progress of the project can be monitored by a project 45 compared to the current master version. The user may select 
supervisor. If the site-specific data includes a result which is whic : h PATH components to update to the most current 
deemed to be a new best practice or lesson learned or new version. 

information, the project manager may send a report to the An example of the Setup/Update utility for updating the 

central office or include a notation with the project data project management program is illustrated via the pre- 

exported to the global database in order to call the attention 50 formatted screen display shown FIG. 5. Each PATH com- 

of the project supervisor to review this new information. ponent is fisted on the screen. Each component has a 

The project data exported or reported to the global data- "Status." The status will set to "Installed" if the component 

base are stored as project-specific files. The project files ^ Stalled on the local computer (12), and is current, 

reviewed by a project supervisor, and any significant new "OLD" means the component is an old revision. A blank 

best practices, lessons learned, or new information are 55 status indicates the component is not installed on the local 

forwarded to or noted for technical review by a mission computer (12). The "Install", "Update" and "Remove" 

review group. If the mission review group determines the check boxes are used to an aclion - ^ "Install" check 

reported practice to be a new best current practice for all box is ^ t0 new components that do not already 

participating projects, the best current data of the global ejdst 011 the local computer (12). The "Update" check box is 

database is updated to reflect the new best current practice 6° used t0 u P<*ate an existing component on the local computer 

and a new revision number is assigned to the project ( 12 ) that te out of date * ^ "Remove" check box will 

management program which includes the corresponding remove an existing component from the local computer (12). 

c j ata Each component also displays a description, current revision 

number, date of current revision and disk space installation 

Revision Control Procedure 65 requirements. 

A central function of the project management system in Projects are performed (13) on the local computer (12), in 

accordance with the present invention is the program updat- the home office or field at the local site. As mentioned 
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previously, project management tasks are performed in the previous version of the program, such as when an on-going 

PATH program example, primarily by list management. The project receives updated information, the new information is 

local computer may be disconnected from the Central PATH merged with the existing project files using a Combine 

Storage (10) such as when the site project manager or fusion. The Combine function combines the two sets of 

members of the project team use a portable computer while 5 projecl data togethef tQ form an updated or Qew versioQ of 

A - * , . . - , . • * * . . the project data. The new project data is a basically a merge 

After the project is performed, or at periodic intervals c , . . . „ , \ , . 4 ct t , * 

while a project of long duration is being performed, the of ltems ' attachmenls and P ro J ect files & ° m l*>th 

site-specific project data (checklists and project files) are project data sets. 

exported to the central server of the project services vendor. 10 Combine function utilizes a "subject code" that is 

At the centxal server the site-specific project data is stored assi d t0 each check]ist item checklist items that come 

in Project Archives (14) The Project Archives thus include fr fa fil , f 

complete histones of all projects that have been performed . , . , . , _ . 
through the stored copies of the projects* programs and data stora g e are assi S ned «" .identifiable series code. Project- 
files. For reviewing data from former projects or modelling specific checklist items, i.e., items generated locally reflect- 
new projects, the central office's project supervisors or 15 m g l° ca l information, may have a different series code or no 
mission review group or any site manager can access the old subject codes. Where two checklist items are similar or 
projects for information to use in new projects. equivalent, but appear in different master templates or 
As a result of performance of a project or report from an different versions of the same master template, the Combine 
on-going project, any valuable Lessons Learned and Process function assigns the merged item the same master subject 
Improvements are transmitted back to central office (10) as 20 co de. Where a checklist item has a project-specific or no 
part of the dynamic process in the invention of sharing the subject codCj it ^ carried over to the ed list 
best practices learned locally with all other projects globally. 

The PATH program includes a pre-formatted utility for When an existing project is to be updated with a new 

reporting Lessons Learned. A sample of the screen display version of a master program or templates, the Combine 

for the Lesson Learned reporting utility is shown in FIG. 6. 25 function is used to update the local project by treating the 

A project team may also report general comments through a existing project lists as one project and the updated master 

pre-formatted utility -for PATH Feedback. A sample of the templates as another projecL TOs ^low any site-specific 

screen display for the PATH I Feedback reporting utility is mformation at the local ject to 5e carried over mt0 the 

shown in FIG. 7. This feedback includes both software , , . , , t , . , ,. . f 

, c n , * * ■ updated program along with the new task lists from the 

improvements and reference file or content improvements. 30 , , , 7 4 

L, , „. „. „ , . updated master templates. 

The central office management star! collects and reviews 

all such feedback (16). From the feedback, new revisions of Referring to the flow diagram in FIG. 8, the Combine 

one or more PATH components may be created (17), and function works as follows. The numbers in parentheses are 

new revision numbers are then assigned to these used in the diagram to reference the sequence of steps of the 

components, which are then transmitted and stored in the 35 procedure. First, all primary project items and attachments 

Central PATH Storage (10). Thus, when the Central PATH are copied to the combined fist (105). Then each item in the 

Storage is next accessed for a new project or to update an secondary project is checked (110), to determine whether the 

on-going project, the new information approved by the i tem ^ a pro ject specific item (in which case a subject code 

management staff is made available in the now current k not assigned) (111). If so, the item and attachments are 

version of the project files. 40 copied t0 the combined list ( U2 ). If the item is not a project 

TTie Export Project feature of the PATH program operates specific item and originated from a master template, then the 

by extracting project data from the local terminal PATH sub ject code for the item is checked to see if the item already 

database into a temporary database. The temporary database exists fmm the ^ ^ (m) Jf ^ Qn{ a gi le ^ 

and all project specific files are compressed into one fi e ^ maintained on lhe ^bined list and the attachments from 

(using PKWare File Compression^ Software)^ one file 45 both items are merged for the combined list (114). If the item 

can be transferred to other users. The Import Project feature ^ nQ{ k common with the ^ items ied QQ the 

operates by uncompressing the exported file and storing the combined list (has a differcnl su5ject code)j men the item 

project data in the local terminal PATH database, and storing and aUachment s are copied to the combined list (115). 
the project specific files into the PATH file storage scheme. 

On the PATH Feedback screen, the user sends in this data 50 The following example demonstrates the application of 

to a central location which accumulates and stores all the tQe Combine function to merge a specified task list of a 

questions and concerns by category of PATH. The user has Project A with a corresponding list from a Project B to obtain 

the option of including anyone else on the distribution of the ■ an updated Project C: 
data by simply clicking on the address portion Everything is 

automatic and requires no prior knowledge of the MAIL 55 Pto cct A 

system. Access to all questions/comments sent are readily — ^ 

retrievable to users having access to the central database. # item Subject Code 

The Lessons Learned Data sheet is provided to the *■ Create Schedule. 101 

selected mission review groups and business managers so \ w^c^ay^ian 103 

that new best practices can be incorporated for future 60 4! Collect Design Record Files. 0 (project specific) 

projects of a similar type. This standard structure makes it Project B 

easy to compare the various projects and to maintain a # sub'ect 

database of experience which is useful to later project teams. * Create ^Schedule^ 101 

Combine Function 2 - Create Budgets. 102 

65 3. Plan Teamwork & Leadership. 249 

When an updated version of the project management 4. Approve Budgeis. 0 (project specific) 
program is installed at a local computer still running the 



03/01/2004, EAST version: 1.4.1 



5,826,252 



11 

-continued 



12 



Project C (Combination of Project A and Project B) 



# Item Subject Code 

1. Create Schedule. 

2. Create Budgets. 

3. Write Quality Plan. 

4. Plan Teamwork & Leadership. 

5. Collect Design Record Files. 

6. Approve Budgets. 



101 
102 
103 
249 

0 (project specific) 
0 (project specific) 



In summary, the overall architecture for the project man- 
agement system of the invention allows the new best infor- 
mation obtained locally to be exported to the central server 
of the services vendor where it is reviewed and can be shared 
globally with all projects of similar type. The Revision 
Control procedure ensures that the new best current infor- 
mation is installed at the local projects when a new project 
is initiated or when an on-going project is updated. The local 
project management program uses the Combine function to 
merge the new information with the existing project infor- 
mation to update the local project task lists. The system uses 
a common project management program and common- 
format files to facilitate the project updating and information 
sharing. 

Although the invention has been described with reference 
to a preferred implementation, it will be appreciated that 
many other applications, variations and modifications 
thereof may be devised in accordance with the principles 
disclosed herein. The claimed aspects of the invention, 
including all such applications, variations and modifications 
deemed to be within the scope and spirit thereof, are defined 
in the following claims. 

We claim: 

1. A system for managing multiple projects of a similar 
type comprising: 

(a) a global project management database for storing data 
for all projects participating in said system, said data- 
base being maintained and updated dynamically with 
best current data representing best current practices 
across all participating projects in said system; 

(b) a plurality of localized computer terminals, each 
having a microprocessor for executing a project man- 
agement program for a participating project associated 
with each said terminal at a respective local site and a 
memory for storing and retrieving data obtained from 
said global project management database for use in the 
project management program at the respective local 
site; 

(c) data access means operative between said global 
project management database and each one of the 
localized computer terminals for periodically importing 
data from said global project management database, 
including best current data across all projects, for use in 
the participating projects, and for periodically export- 
ing data from the participating projects to said global 
project management database; and 

(d) updating means operative in conjunction with said 
global project management database for allowing 
evaluation of said data periodically exported from the 
localized computer terminals of the participating 
projects, determining whether any such data constitutes 
best current practices across all participating projects, 
and updating the best current data of said global project 
management database with data determined to consti- 
tute new best current practices across all participating 
projects, said new best current practices data then being 
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imported by the localized computer terminals from said 
global project management database, 

wherein said updating means and said data access means 
are operative to update the localized computer termi- 
nals with the new best current practices data whenever 
a current version of data maintained at a localized 
computer terminal is outdated as compared to the new 
best current practices data available from said global 
project management database, 

whereby said global project management database is 
updated dynamically with best current data represent- 
ing new best current practices across all participating 
projects of said system, and the localized computer 
terminals import and use the new best current practices 
at the respective participating projects. 

2. The system as described in claim 1, wherein said data 
access means comprises means for importing and exporting 
data by periodic exchange of data stored on data storage 
media between said localized computer terminals and said 
global project management database. 

3. The system as described in claim 1, wherein said data 
access means comprises a network connecting each of said 
localized computer terminals with said global project man- 
agement database for importing and exporting data periodi- 
cally therebetween. 

4. The system as described in claim 1, wherein said 
project management program used by the localized com- 
puter terminals includes means for reporting lessons learned 
or best current practices developed at a participating project 
to said global project management database. 

5. The system as described in claim 1, wherein the project 
management program used by the localized computer ter- 
minals is the same as that maintained with the global project 
management database, and a uniform directory and file 
structure is used by the common project management pro- 
gram. 

6. A system for managing multiple projects of a similar 
type comprising: 

(a) a global project management database for storing data 
for all projects participating in said system, said data- 
base being maintained and updated dynamically with 
best current data representing best current practices 
across all participating projects in said system; 

(b) a plurality of localized computer terminals, each 
having a microprocessor for executing a project man- 
agement program for a participating project associated 
with each said terminal at a respective local site and a 
memory for storing and retrieving data obtained from 
said global project management database for use in the 
project management program at the respective local 
site; 

(c) data access means operative between said global 
project management database and each one of the 
localized computer terminals for periodically importing 
data from said global project management database, 
including best current data across all projects, for use in 
the participating projects, and for periodically export- 
ing data from the participating projects to said global 
project management database; and 

(d) updating means operative in conjunction with said 
global project management database for allowing 
evaluation of said data periodically exported from the 
localized computer terminals of the participating 
projects, determining whether any such data constitutes 
best current practices across all participating projects, 
and updating the best current data of said global project 
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management database with data determined to consti- 
tute new best current practices across all participating 
projects, said new best current practices then being 
imported by the localized computer terminals from said 
global project management database, 

whereby said global project management database is 
updated dynamically with best current data represent- 
ing new best current practices across all Participating 
projects of said system, and the localized computer 
terminals import and use the new best current practices 
at the respective participating projects, and 

wherein the project management program includes means 
for using the data obtained from said global project 
management database in a Project Planning Mode, then 
transferring the data from the Project Planning Mode 
for use in a Project Performance Mode. 

7. A system for managing multiple projects of a similar 
type comprising: 

(a) a global project management database for storing data 
for all projects participating in said system, said data- 
base being maintained and updated dynamically with 
best current data representing best current practices 
across all participating projects in said system; 

(b) a plurality of localized computer terminals, each 
having a microprocessor for executing a project man- 
agement program for a participating project associated 
with each said terminal at a respective local site and a 
memory for storing and retrieving data obtained from 
said global project management database for use in the 
project management program at the respective local 
site; 

(c) data access means operative between said global 
project management database and each one of the 
localized computer terminals for periodically importing 
data from said global project management database, 
including best current data across all projects, for use in 
the participating projects, and for periodically export- 
ing data from the participating projects to said global 
project management database; and 

(d) updating means operative in conjunction with said 
global project management database for allowing 
evaluation of said data periodically exported from the 
localized computer terminals of the participating 
projects, determining whether any such data constitutes 
best current practices across all participating projects, 
and updating the best current data of said global project 
management database with data determined to consti- 
tute new best current practices across all participating 
projects, said new best current practices then being 
imported by the localized computer terminals from said 
global project management database, 

whereby said global project management database is 
updated dynamically with best current data represent- 
ing new best current practices across all participating 
projects of said system, and the localized computer 
terminals import and use the new best current practices 
at the respective participating projects, and 

wherein said data access means includes means for per- 
forming revision control by comparing an existing 
revision number of the project management program at 
a localized computer terminal with a revision number 
assigned to best current data currently imported from 
the global project management database. 

8. The system as described in claim 7, wherein said means 
for performing revision control includes Combine means for 
combining new best current data currently imported from 
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the global project management database with best current 
data maintained at the localized computer terminal for a 
participating project. 

9. The system as described in claim 8, wherein said 
5 Combine means includes means for creating a combined list 

of best current data items by first copying best current data 
items for the participating project to the combined list, then 
comparing each new best current data item imported from 
the global project management database for inclusion on the 
combined list. 

10. The system as described in claim 9, wherein each best 
current data item imported from the global project manage- 
ment database has a unique identifying code so that it is 
distinguishable from best current data items that are entered 
at the localized computer terminal as project specific items, 
and said Combine means operates to merge best current data 
items having unique identifying codes and copying best 
current data items not having unique identifying codes. 

11. A method for managing multiple projects of a similar 
type comprising the steps of: 

(a) maintaining a global project management database for 
storing data for multiple participating projects, said 
database being maintained and updated dynamically 
with best current data representing best current prac- 

25 tices across all participating projects; 

(b) providing a plurality of localized computer terminals, 
each having a microprocessor for executing a project 
management program for a participating project asso- 
ciated with each said terminal at a respective local site 
and a memory for storing and retrieving data obtained 
from said global project management database for use 
in the project management program at the respective 
local site; 

(c) periodically importing data to each one of the local- 
ized computer terminals from said global project man- 
agement database, including best current data across all 
projects, for use in the participating projects, and 
periodically exporting data from the participating 
projects to said global project management database; 

40 and 

(d) evaluating said data periodically exported from the 
localized computer terminals of the participating 
projects, determining whether any such data constitutes 
best current practices across all participating projects, 

45 and updating the best current data of said global project 
management database with data determined to consti- 
tute new best current practices across all participating 
projects, said new best current practices data then being 
imported by the localized computer terminals from said 

50 global project management database whenever a cur- 
rent version of data maintained at a localized computer 
terminal is outdated as compared to the new best 
current practices data available from said global project 
management database, 

55 whereby said global project management database is 
updated dynamically with best current data represent- 
ing new best current practices across all participating 
projects, and the localized computer terminals import 
and use the new best current practices at the respective 

60 participating projects. 

12. The method as described in claim 11, wherein said 
importing and exporting of data is conducted by periodic 
exchange of data stored on data storage media between said 
localized computer terminals and said global project man- 

65 agement database. 

13. The method as described in claim 11, wherein said 
importing and exporting of data is conducted periodically by 
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connection to a network connecting each of said localized 
computer terminals with said global project management 
database. 

14. The method as described in claim 11, wherein said 
periodic exporting data step includes reporting lessons 
learned or best current practices developed at a participating 
project to said global project management database. 

15. The method as described in claim 11, wherein said 
program providing step includes providing the localized 
computer terminals with a common project management 
program having a uniform directory and file structure as 
maintained with the global project management database. 

16. The method as described in claim 11, wherein said 
importing data step includes using the data obtained from 
said global project management database in a Project Plan- 
ning Mode, then transferring the data from the Project 
Planning Mode for use in a Project Performance Mode. 

17. The method as described in claim 11, wherein said 
importing data step includes performing revision control by 
comparing an existing revision number of the project man- 
agement program at a localized computer terminal with a 
revision number assigned to best current data currently 
imported from the global project management database. 
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18. The method as described in claim 17, wherein said 
step of performing revision control includes combining new 
best current data currently imported from the global project 
management database with best current data maintained at 
the localized computer terminal for a participating project. 

19. The method as described in claim 18, wherein said 
combining step includes creating a combined list of best 
current data items by first copying best current data items for 
the participating project to the combined list, then compar- 
ing each new best current data item imported from the global 
project management database for inclusion on the combined 
list. 

20. The method as described in claim 19, wherein import- 
ing data step includes assignment of a unique identifying 
code to each best current data item imported from the global 
project management database so that it is distinguishable 
from best current data items that are entered at the localized 
computer terminal as project specific items, and merging 
together best current data items having unique identifying 
codes and copying best current data items not having said 
unique identifying codes. 
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METHOD AND APPARATUS FOR 
IMPLEMENTING SEARCH AND CHANNEL 
FEATURES IN AN ENTERPRISE-WIDE 
COMPUTER SYSTEM 

RELATED APPLICATIONS 

This application depends from and claims priority to U.S. 
Provisional Patent Application No. 60/200,090, filed Apr. 
27, 2000, which is hereby incorporated by reference. This 
application is cross-referenced to related co-pending U.S. 
application Ser. Nos. 09/845,057 filed Apr. 27, 2001, 09/844, 
715 filed Apr. 27, 2001 and PCT Application Serial No. 
PCT/US01/13842 filed Apr. 27, 2001. 

BACKGROUND 

Many businesses and other enterprises use a variety of 
computer systems that are specially adapted for certain 
purposes. For example, a manufacturing division of an 
enterprise may use one kind of computer system specifically 
designed to handle manufacturing data while the sales 
division of the same enterprise may use another kind of 
system for sales information. The engineering division of the 
enterprise may use an entirely different computer system as 
well. Using different computer systems for different divi- 
sions of an enterprise makes sense because each kind of 
computer system will provide certain strengths that suit that 
division. 

Although different divisions within an enterprise may use 
different computer systems, there are advantages to sharing 
data across an entire enterprise. For example, an individual 
in the sales division may need to know the current inventory 
levels for a product in the manufacturing division to deter- 
mine what price should be set for the product. One solution 
to this problem is to provide hard copies of reports from 
different divisions of an enterprise to certain key individuals 
in the enterprise. This procedure is disadvantageous because 
it can overwhelm an individual with much more information 
than the individual needs and because the data in the hard 
copies of the report can be out of date by the time that the 
individual reviews it. Another solution to this problem is to 
use emulator computers that allow an individual to use a 
single computer to access more than one computer system. 
This procedure is also disadvantageous because the indi- 
vidual is required to learn a new interface and a new 
computer language for each computer system that he is to 
access. Thus, there is a need for an enterprise -wide computer 
system that can connect to a variety of computer systems, 
retrieve data from these systems, and present data to an 
individual in a standardized, easy-to-leam format. 

SUMMARY 

Disclosed herein is an enterprise -wide computer system 
designed to be connected to a variety of different computers 
systems within the enterprise. The computer system uses a 
portal architecture to allow a user to view a wide variety of 
content retrieved from a variety of different computer sys- 
tems. The computer system may also be referred to as a 
portal system. The portal system is configured such that a 
plurality of users can access the system at the same time 
through a computer network such as the Internet. The portal 
system may also be connected to one or more back-end 
databases that correspond to the different computer systems 
within the enterprise. The portal system is scalable because 
many of its components are modular and can be readily 
duplicated as redundant processors. In this manner, small 
enterprises and large enterprises may be accommodated by 
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different versions of the same portal system. In one aspect, 
the portal system acts as a middle -ware program that con- 
verts the data and reports from the variety of back-end 
databases and presents the data to a user in a standardized 

5 format. Data is provided to users by the portal system in a 
format that is readable by a browser program. Thus, by 
allowing a user to use a standard browser program as a user 
interface, the user's learning curve for the portal is greatly 
reduced. In particular, the user will be able to select reports 

1Q and data for viewing by pointing at an item with his mouse 
and selecting a hyperlink." 

In addition to converting data from back-end databases 
into a standardized format for a user, the portal system may 
be configured to ran predefined jobs to process data. These 
jobs are stored within the portal system in a computer 

35 memory device called a repository. These jobs can perform 
a variety of tasks such as retrieving data from -a back-end 
database, preparing a report based upon retrieved data, 
processing data already resident within the portal system, or 
notifying a user when a particular condition occurs within 

20 the portal system. These jobs can be executed on a pre- 
defined schedule or on an ad-hoc basis at the request of a 
user. When a job is executed on a predefined schedule, the 
output report of the job will often be stored in the repository 
so that it can be retrieved at a later time. When a job is 

25 performed on an ad-hoc basis, the output report will gener- 
ally be provided to the user immediately through his browser 
interface. If a job is of particular interest to a user, then the 
portal system allows a user to subscribe to the job. A 
subscription will send a notification to the user whenever the 

30 job is executed by the portal system. The portal system also 
allows a user to configure one or more exception conditions 
for a job that indicate when some element of the output 
report is outside of a predefined range. A user can subscribe 
to job exceptions and thus be notified when these exceptions 

35 occur. 

The portal system presents data to a user in an object 
called a portal page. The portal page is an object arranged in 
a format that is readable by a browser program. The portal 
page is a highly configurable document that may be com- 

40 prised of a plurality of modules called portal objects. Each 
portal object may contain a set of links corresponding to 
output reports, jobs, or other objects stored within the 
repository. Thus, by clicking on one of the links in a portal 
object, the portal system will process the object correspond - 

45 ing to that link. If the link is directed to a job stored within 
the portal system, then clicking on that job will cause the job 
to be executed. If the link is directed to a browsable object 
stored within the repository, then that object will be dis- 
played to the user. A portal page may also include a display 

50 window that can display browsable objects to a user. 
Another feature of the portal page is a dynamically updated 
portal object, A dynamically updated portal object is an 
object that is updated on the user's portal page based upon 
data stored in the portal system. If a dynamically updated 

55 portal object is included within a user's portal page, the user 
may receive the latest information corresponding to that 
object by refreshing his portal page. For example, if the 
dynamically updated portal object is linked to the output 
report of a job, then the portal object will display the latest 

60 version of the output report to the user when the portal page 
is refreshed. A dynamically updated portal object may also 
be hyperlinked to its corresponding object in the portal 
system such that a user may view, edit, or execute the 
corresponding object by clicking on the dynamically 

65 updated portal object at the user interface. 

Each user's portal page may be customized to suit that 
user's specific needs. A user may add or remove portal 



03/01/2004, EAST Version: 1.4.1 



US 6,643 : 

3 

objects from his portal page at his discretion. A user may 
also edit some portal objects in order to add links to reports 
or objects that the user is interested in. Another way in which 
a user can customize his portal page is to add and modify 
"favorites" on the portal page. A user's favorites is a set of 5 
links to objects stored in the repository, on an intranet, or on 
the Internet. These objects may be jobs, reports, or any other 
kind of data. By clicking one of these links, the correspond- 
ing object is presented in the display window. 

The portal system may also be configured to conduct 10 
searches on behalf of a user. The portal system provides the 
ability to search both structured data (databases, XML, 
formatted text, etc.) and unstructured data (HTML files, 
web-based content, PDF files, etc.) at locations inside and 
outside the portal. The portal system 120 also allows the user 15 
to configure the searches so that only certain objects, in 
certain locations are searched. By using these search 
parameters, a user can streamline a search to identify only 
highly relevant data. This increases the efficiency of the 
search and reduces the likelihood of identifying undesired 20 
results. If a user constructs a search that produces particu- 
larly relevant results, then the user may save those search 
parameters as a channel. The user can return to this channel 
at a later date to conduct the same search to see if any new 
objects have been identified. A list of channels stored by a 25 
user may be included in a user's portal page, allowing him 
access to search results by simply clicking on the appropri- 
ate channel link. 

DESCRIPTION OF THE DRAWINGS 30 

FIG. 1 depicts a high level view of the portal system 
connected to a plurality of back-end database and to a 
plurality of users. 

FIG. 2 depicts a lower level view of the portal system 35 
including the various service agents. 

FIG. 3 depicts an example of the hierarchy of categories 
and objects residing in the repository, 

FIG. 4 depicts some of the categories of properties 
associated with jobs stored in the repository. 40 

FIG. 5 depicts some of the properties associated with 
schedules residing in the event server. 

FIG. 6 depicts some of the properties associated with each 
service agent residing in the portal. 45 

FIG. 7 depicts some of the properties associated with a 
repository, an authentication server, and a job server residing 
in the portal. 

FIG. 8 depicts some of the properties associated with a 
search server and a channel residing in the portal. 50 

FIG. 9 depicts some of the categories of properties 
associated with a crawler residing in the knowledge server 
of the portal. 

FIG. 10 depicts a representative example of a portal page 55 
as seen by a user with a browser program. 

FIG. 11 depicts a representative example of an input form 
presented to a user during the execution of a job. 

DETAILED DESCRIFHON 

60 

Disclosed herein is a method and apparatus for imple- 
menting an enterprise-wide portal system. The system is 
designed to connect a plurality of users to the portal system 
so that the users can access and process data that is stored 
therein. The system may also be connected to one or more 65 
back-end databases so that a user can view, and process data 
that is stored therein. In one embodiment of the portal, a 
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variety of back-end databases using different operating 
systems are connected to the portal system. In this manner, 
the portal system allows a user to access data from a wide 
variety of back-end databases with a single computer inter- 
face. Another described aspect uses the portal system as a 
middle -ware program for converting a user's instructions 
into commands to retrieve and process data from the back- 
end databases. Another described aspect uses the portal 
system to display the results of a back-end process to the 
user in a format that can be read by a standard browser 
program. Another described aspect uses the portal system to 
process data that is stored in the portal system and provide 
output reports to a user. The portal system thus provides a 
one-stop interface for accessing, processing, and providing 
a wide variety of data to a plurality of users. In order to 
simplify the access to the computer system, the user inter- 
face may be based upon a standard browser program that is 
capable of reading Hypertext Markup Language (HTML). 
The browser may also be capable of reading other web- 
based programs such as Java, XML, Macromedia Flash, or 
other languages. By using a standardized browser program 
as a user interface to the computer system, the user is 
presented with a familiar format in which a user can point 
and click on hypertext links to navigate through the portal 
system and provide instructions to the portal system. 

FIG. 1 depicts a high-level illustration of one embodiment 
of the portal system 120. In FIG. 1, a plurality of users 100 
are connected to a network interface 105 through a computer 
network 110. The computer network 110 can take many 
forms including a direct connection, a local-area network, an 
enterprise intranet, a wireless network, the Internet, or any 
combination thereof. The network interface 105 is con- 
nected to a portal system 120 through a web client 115. 
Within the portal system 120 are a service broker 125 that 
controls access to the computer system and a plurality of 
service agents 130 that are configured to perform specific 
tasks within the portal system 120. Also connected to the 
portal system 120 are several back-end databases 135, 140, 
145, 150 in which data is stored. It should be noted that FIG. 
1 is a block diagram that represents certain functional 
aspects of the invention as separate blocks. These functional 
blocks may be implemented on separate computer platforms 
or on the same computer platform. 

In FIG. 1, each of the back-end databases 135, 140, 145, 
150 may contain different kinds of data and may use 
different operating system platforms. For example, back-end 
database 135 could be a Unix-based system in which sta- 
tistical process control information about a manufacturing 
facility is stored. Back-end database 140 could be a 
PC-based database in which human resources data 
(employee payroll, headcount, organizational structure, etc.) 
is stored. Back-end database 145 could be an Oracle-based 
system in which sales and inventory information is stored. 
Lastly, back-end database 150 could be a Windows NT 
server in which benefits and pension information is stored. 
Different databases and platforms are sometimes used for 
different groups within an enterprise because each group has 
specialized needs that are best served by their respective 
back-end databases. Using different databases and platforms 
within the same enterprise, however, makes the combination 
and comparison of data from different groups difficult. The 
embodiments disclosed herein address this difficulty by 
using the portal system 120 as a common interface between 
the various back-end databases 135, 140, 145 & 150 and a 
user 100. By using the portal system 120 as a common 
interface, data can be retrieved from the back-end databases 
and presented to the user in a standardized format through 
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the web client 115. For example, a user 100 may request that 
the portal system 120 produce a graph illustrating the 
enterprise's manufacturing yield over the past year. Upon 
receiving the request, the portal system 120 would retrieve 
yield data from manufacturing back-end database 135 and 5 
process that data to generate a bar chart corresponding to the 
user's request. This bar chart would then be presented to the 
user 100 through his browser program. That same user 100 
may also request, during the same session, an update of the 
sales figures for the enterprise for the current month. The 10 
portal system 120 would retrieve sales data from the sales 
back-end database 145, process that data, and generate a 
figure corresponding to the user's request. This data would 
then be presented to the user 100 through his browser 
program. The portal system 120 has the ability to simulta- 35 
neously perform each of these tasks and present this data to 
the user 100 with a single interface. 

FIG. 2 discloses another embodiment of the portal system 
120. In FIG. 2, a plurality of users 100 are connected to a 
network interface 105 through a computer network 110. A 20 
web client 115 is resident on the network interface 105 that 
interfaces the users to a portal system 120. Also illustrated 
in FIG. 2 are three back-end databases 200, 205 and 210 that 
are connected to the portal system 120. Within the portal 
system 120 arc a service broker 125 and a plurality of 25 
service agents: an event server 215, an authentication server 
220, a name server 225, a job server 230, a repository 235, 
and a knowledge server 240 that includes a search server 245 
and a crawl server 250. It should be noted that FIG. 2 is a 
block diagram that represents certain functional aspects of 30 
the portal system 120 as separate blocks. These functional 
blocks may be implemented on'separate computer platforms 
or on the same computer platform. The functions served by 
the service agents of FIG. 2 are summarized below. 

The service broker 125 serves two functions in the portal 35 
system. It controls access to the portal system 120 by users 
100 and controls the disposition of jobs to the service agents 
within the portal system. By controlling the disposition of 
jobs, the service broker 125 ensures that jobs are processed 
in an orderly manner and that none of the service agents 40 
become overloaded. The event server 215 schedules events, 
such as jobs, for processing in the portal system 120 on a 
predefined timetable. The authentication server 220 is used 
to determine if a particular user should be granted access to 
the portal system 120. The permissions and group member- 45 
ships for a particular user are also stored in the authentica- 
tion server 220. The name server 225 is the storage location 
for configuration information about all of the other service 
agents. For example, if the service broker 125 needs to know 
the location of a specific job server 230, then the name server 50 
225 will provide that information to the service broker 125. 
The job server 230 is used to execute jobs in the portal 
system 120. In addition, the job server 230 can retrieve data 
from a back-end database 200, 205 or 210 to be processed 
for a particular job. Each job server 230 may be connected 55 
to at least one back-end database 200, 205 or 210 in order 
to retrieve data therefrom. The job server 230 may also be 
a stand-alone unit which process jobs that do not retrieve 
data from any external sources. The repository 235 is used 
as a storage device for all information that is to be stored in 60 
the portal system. All computer files that are stored in the 
repository 235 are called objects. These objects may include 
HTML files, job output reports, executable job files (SQL, 
etc.), image files, etc. Objects that are stored in the reposi- 
tory 235 are arranged in a hierarchy called categories. 65 
Within each category, both objects and subcategories may be 
stored. Categories are thus organized in a tree system much 
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like the file system on a standard computer In addition, each 
object in the repository may include more than one version. 
Versioning can be used to accomplish a variety of objectives 
including setting multiple security levels for different ver- 
sions of an object, and allowing a user to see a modification 
history of an object. The knowledge server 250 provides the 
search and channel functions for the portal system 120. The 
knowledge server 250 is comprised of two components: a 
search server 245 and a crawl server 250. The crawl server 
250 uses one or more crawlers to analyze and index specific 
information that is stored in the repository 235, a company 
intranet, or the Internet. A crawler can be configured to 
search only in certain locations in the repository 235, a 
company intranet, or the Internet for information to be 
indexed. The indices produced by the crawl server 245 are 
stored in the knowledge server 240 in files called informa- 
tion sources. Depending upon the settings of the crawl server 
250, an information source will contain an index of objects 
found both within the portal system (i.e. in the repository 
235), or outside the portal system (i.e. on an intranet or the 
Internet). The crawl server 250 is capable of indexing 
structured and unstructured data. The search server 245 uses 
the information sources produced by the crawl server 250 to 
conduct searches on behalf of a user. Because the informa- 
tion sources will generally correspond to specialized topics, 
a user may increase the efficiency of a search by selecting 
only those information sources that are relevant to his 
search. The portal system 120 can include redundant service 
agents for processing user requests. In this manner, the 
portal system 120 is scalable to handle both a small enter- 
prise with a small number of users and a large enterprise 
with many redundant service agents for processing requests 
from thousands of users. 

One aspect of the portal system 120 utilizes the various 
service agents to process jobs for the benefit of users. Many 
of these jobs can retrieve data from the back-end databases 
200, 205 & 210 and process that data to generate an output 
report. Jobs may also be used to process data that is resident 
within the portal system 120. For example, jobs could 
include a weekly report on manufacturing statistics for the 
enterprise, or a report describing the current status of the 
enterprises' accounts receivable. Because these jobs utilize 
data that is retrieved directly from the back-end databases, 
the output reports generated by these jobs reflect an up-to- 
the-minute status of the corresponding aspect of the enter- 
prise. Generally, a job is stored in the repository 235 of the 
portal system 120. When a job is to be executed, it is 
retrieved from the repository and sent to an appropriate job 
server 230 for processing. At the job server 230, the job is 
executed. Sometimes, a job will require that certain data be 
retrieved from a back-end database 200, 205 or 210. In many 
instances, jobs are written in SQL language so as to facilitate 
the retrieval of data from the back-end databases. After data 
is retrieved from a back-end database 200, 205 or 210 and 
processed by the job, the job will produce an output report. 
This output report may be stored in the repository 235 after 
the job is complete. An output report may also be provided 
directly to a user 100 through the web client 115. 

Jobs may be processed by the portal system 120 on either 
an ad-hoc basis or on predetermined schedule. Jobs pro- 
cessed on an ad-hoc basis are usually executed at the request 
of a user 100 connected to the portal system 120. When a job 
is processed on ad-hoc basis, the job is first retrieved from 
the repository 235 and sent to an appropriate job server 230 
for processing. After processing, the output report will be 
transmitted to the user 100 via the web client 115. The output 
report may be stored in the repository 235 even though it 
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was processed on an ad-hoc basis. Jobs may also be con- portal system 120. Because a portal page is presented to a 

figured to run on a predetermined schedule. Information user 100 through the web client 115, the data must be 

describing these schedules is stored in the event server 215. arranged in a format that is readable by a user's browser 

When configuring a job to run on a predetermined schedule, program. In FIG. 10, a wide variety of data is presented to 

the job must first be associated with a schedule in the event 5 a user 100 in the form of portal objects. A portal object is a 

server 215. If there is not a pre-existing schedule in the event modularized collection of links, graphics, or other data that 

server 215 that matches the timetable for the job, then a new may be presented to the user in a portal page 1000. The 

schedule can be created in the event server 215. When the portal objects depicted in FIG. 10 include broadcast mes- 

designated time for a schedule arrives, the event server 215 sa g es 1005, a company billboard 1010, a user's customized 

generates a list of the jobs that have been associated with 1Q bookmarks 1015, an exceptions dashboard 1020, and a 

that schedule and sends that list of jobs to the service broker syndicated content object 1030. Also present in the portal 

125 for execution. The service broker 125 then dispatches pa g e 1000 of FIG. 10 is a display window 1025. A display 

the jobs for execution on the appropriate job server 230. The window 1025 is a window in the portal page 1000 in which 

output results for each of these jobs are then sent to the a user 100 may view browsable objects. The display window 

repository 235 for storage. 15 1025 may display a variety of objects from the repository 

Another aspect of the portal system 120 relates to sub- (output reports, HTML objects, dashboards, etc.) or pages 

scriptions, A user may subscribe to a particular object or from the Internet. A user can select content to be displayed 

category that is stored in the repository 235. Thus, if an in the display window 1025 by selecting an appropriate link 

object or category within the repository 235 is modified, in the portal page 1000. The portal objects present in a user's 

then all of the subscribing users are notified of the change. 20 portal page are highly configurable so that a user may 

Users may also subscribe to a job such that when a job is customize his portal page 1000 to suit his particular needs, 

executed, the user will be notified. If a user subscribes to a Some portal objects can be configured such that they must 

job, then he will be notified of its execution regardless of appear on every user's portal page 1000. These portal object 

whether the job was run on a pre-determined schedule, or on are called mandatory portal objects. Mandatory portal 

an ad-hoc basis. Users may receive notification in a variety 25 objects may be used to ensure that all users 100 of the portal 

of ways including e-mail or notification on the user's portal system 120 are presented with certain content whenever they 

page. The portal system may also be configured to provide use the portal system 120. An example of such a mandatory 

a copy of the job's output report as an attachment to the portal object is the broadcast messages portal object 1005 in 

notification e-mail or as an automatic update to a user's FIG. 10. Portal objects may also be configured such that a 

portal page. 30 user 100 may remove the portal object from his portal page, 

Another aspect of the portal system 120 relates to the use but cannot modify the content of the portal object. An 
of exceptions. An exception is a condition that is tied to the example of this kind of portal object is the company bill- 
results of a job. An exception occurs when the output report board portal object 1010 of FIG. 10. In FIG. 10, it can be 
of a job includes information that is outside of a predeter- seen that a user 100 may remove the company billboard 
mined range. Any number of exception conditions can be 35 portal object 1010 by clicking the "X" icon 1008 in the 
configured for a job. However, if any of them indicate that upper right-hand corner of the object. It can also be seen that 
an exception condition exists, then the entire job will indi- the user does not have the ability to modify the content of the 
cate that an exception condition exists. Only certain jobs company billboard 1010 because an "EDIT" icon is not 
within the portal system 120 can be configured to indicate an present in the upper right-hand corner of the object. Portal 
exception condition. Users can subscribe to exceptions in 40 objects may also be configured such that a user can both 
much the same way that they subscribe to a particular job. modify the content of the object, and remove the portal 
Thus, if the execution of a job produces an exception object from his portal page 1000. An example of this kind of 
condition, then all of the subscribing users will be notified portal object is the "My Bookmarks" portal object 1015 of 
of the exception condition. Notification of exception condi- FIG. 10. In FIG. 10, it can be seen that a user 100 can 
tions may also occur through e-mail or a user's portal page. 45 remove the "My Bookmarks" portal object 1015 in its 
A user may configure his portal page to provide a dynami- entirety by clicking the "X" icon 1008 in the upper right- 
cally updated portal page which displays the status of an hand corner of the object. It can also be seen that the user can 
exception condition. This is called an exception dashboard. modify the content of the "My Bookmarks" portal object 

Yet another aspect of the portal system 120 relates to the 1015 by clicking either of the "EDIT" icon in the upper 

use of channels. A channel is an abstract of a search, which 50 right-hand corner of the object or the "New Bookmark" link 

was constructed by a user, that has been stored in the at the bottom of the object. Thus, a user 100 can customize 

repository for processing at a later date. Generally, a channel the content of his personal portal page 1000, by adding or 

is a search that produces a set of highly relevant results for removing certain portal objects or by modifying the content 

a user. A user can update the channel at any time to see if any of certain portal objects. 

other highly relevant documents have become available. The 55 Another aspect of the portal page 1000 is an exception 
parameters for constructing the search are highly config- dashboard 1020. The exception dashboard is fully config- 
urable by the user, thus allowing him to construct a very urable by a user 100, but may only be used to indicate when 
efficient search. In particular, the channel can be configured certain exception conditions have been met. In FIG. 10, the 
to search limited areas of the repository 235; a company's exception dashboard 1020 is configured to display a traffic 
intranet, and the Internet for new information. A user may 60 light that is green when no exceptions are present and red 
share his channels with other users such that they can when exceptions have been found. A user may add more 
incorporate the channels into their portal pages. A user's than one indicator to the exception dashboard, such that 
channels may be stored in the repository 235 with the user's there is a corresponding indicator for each exception con- 
other portal page data. dition that he has subscribed to. 

FIG. 10 depicts a representative example of another 65 A user 100 may also customize his personal portal page 

aspect of the portal system 120 called a portal page 1000. A 1000 by using favorites and channels. If a user 100 identifies 

portal page presents data to a user when he logs into the a certain object in the repository 235 that is particularly 
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relevant to him, then that user may add the object to his 
Favorites. When an object is added to a user's favorites, a 
link corresponding to that object is added to that user's list 
of favorites. A user may view a list of his favorite objects by 
selecting the "Favorite Items" link 1075 in his personal 5 
portal window 1001. The user 100 may then view any of the 
listed objects in the display window 1025 by clicking on a 
corresponding link. A user 100 may also create a list of 
favorite categories in the repository by using the favorite 
categories link 1080 in his personal portal page 1000. In 30 
addition, a user may create a list of favorite channels by 
using the "my channels" link 1085 in his personal portal 
page. 

FIG. 11 depicts a representative example of another aspect 
of the portal system 120 called a Form. Forms allow a user 15 
100 to provide input to a job while a job server 230 is 
executing the job. Because a form is presented to a user 100 
through the user's browser interface, the form should be in 
a format that can be read by a standard browser program. 
Languages, which can be used to create forms, include 20 
HTML, Java, Macromedia Flash and XML. In FIG. 11, a 
user 100 is presented with four input fields which must be 
provided to the job before it can be executed: i) a sales 
region option 1100, ii) a quarter option 1105, iii) a chart style 
option 1110, and iv) a dimensions option 1115. The sales 25 
region option 1100 and the chart style option 1110 are 
configured as drop-down menus from which a user may 
select. The quarter option 1105 and the dimensions option 
1115 are configured as radio buttons from which a user may 
select. A form may utilize a wide variety of other mecha- 30 
nisms to provide input to a job such as a blank text field or 
an image with selectable fields. Many different input 
mechanisms, which are known in the art of browser lan- 
guage programming, may be utilized here. After a user 100 
has selected values corresponding to each of the input fields, 35 
the user 100 submits these values to the job server 230. In 
FIG. 11, this may be accomplished by pressing the "RUN" 
button 1120 at the bottom of the form. A user 100 may also 
reset the input values that have been selected to the form's 
default values by pressing the "RESET' button 1125 at the 40 
bottom of the form. A user 100 can also save certain input 
settings as the user's default values by selecting the "SAVE 
AS MY DEFAULTS" option 1130 in FIG. 11. When a user 
100 saves certain input values as default values, these 
default values are stored with the user's profile in the portal 45 
system 120. Thus, if the form is presented to the same user 
at a later time, the form will utilize the user's default values 
instead of the system default values. Each job in the reposi- 
tory 235 may be associated with one or more forms depend- 
ing upon how much input is to be provided by the user 100. 50 
The files corresponding to each form are stored in the 
repository 235. 
The Service Agents 

As stated above, the service broker 125 controls access to 
the portal system 120 by a particular user 100. The service 55 
broker 125 also provides session management services for 
users, and acts as a gateway to the other service agents 
within the portal system 120. The service broker 125 dis- 
patches user requests to an appropriate service agent with 
the help of the name server 225. For example, when a client 60 
requests to see files that are stored on the repository 235, the 
service broker 125 will first consult name server 225 to 
determine the location of the repository 235, and then 
dispatch the request to that location. If the portal system 120 
is configured to include redundant service agents, then the 65 
service broker 125 will distribute requests to those service 
agents in a round-robin manner. Each portal system 120 will 
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have only one name server 225 and one repository 235, but 
may have multiple service brokers 125. 

The service broker 125 provides location transparency so 
that users 100 are unaware of the actual location of the 
back-end servers 200, 205 & 210 or the service agents 
within the portal system 120. Accordingly, a service agent or 
back-end database 200, 205 & 210 may be moved from one 
machine to another (possibly for performance reasons) with- 
out affecting the user's interface. The user only needs to log 
in to the correct service broker 125 in order to have access 
to all of the features of the portal system 120. This greatly 
simplifies the login procedure and the browser interface for 
a user. The service broker 125 also distributes work evenly 
among the service agents that support identical services. For 
example, if two job servers 230 provide the same services, 
then one service broker 125 will dispatch work in balanced 
amounts between them. This round-robin load balancing 
improves performance since two machines can process job 
server 230 requests in parallel. Replication of a service agent 
also helps ensure fault tolerance. If two different job servers 
230 provide identical services and one of them is not 
available, then the portal system 120 will continue to operate 
properly, as the service broker 125 dispatches requests only 
to the currently operational job servers 230. Of course, if all 
of the job servers 230 are non-operational, then job server 
requests will fail. 

The name server 225 offers a directory lookup and ini- 
tialization service for the other service agents installed in the 
portal system 120. The name server 225 also manages 
configuration information about the installed service agents. 
Each portal system 120 will have only one name server 225. 
Accordingly, it is useful to think of a portal domain 120 as 
the entity managed by a single name server 225. The name 
server 225 stores metadata about the service agents in a 
Relational Database Management System (RDBMS). As 
part of the installation process, the portal stores the RDBMS 
connectivity information for the name server 225 in a file 
stored in the repository 235. Each service agent in the portal 
system 120 must contact the name server 225 to acquire its 
configuration information during startup. Accordingly, the 
name server 225 should be started before starting any of the 
other service agents in the portal system 120. The name 
server 225 also maintains a configuration administrator 
account, which allows an Administrator to manage configu- 
ration data about the service agents. 

FIG, 3 depicts a representative embodiment of the reposi- 
tory 235 of the portal system 120. The repository 235 is a 
computer memory storage device within the portal system 
120. In FIG. 3, a variety of computer files, known as objects 
300, are stored in the repository 235. Each of the objects 300 
is assigned to a specific Category or Subcategory 305, 310, 
315 within the repository 235. Categories and Subcategories 
305, 310, 315 in the repository 235 are similar to file system 
directories or folders. Each category, subcategory, and 
object is defined with a set of properties. These properties 
include the name of the user who owns the object or 
category as well as permissions for the object or category. 
The permissions define which users can access a category or 
object. This is especially important if certain objects contain 
confidential information that only a few users should see. 
Accordingly, a user or administrator can structure the cat- 
egories such that users can find information in an intuitive 
manner. The categories can also be arranged in a manner that 
efficiently implements security measures for sensitive data. 

An object can be any kind of computer file, including the 
following: 1) Ordinary Files — such as text documents, 
spreadsheets, presentation graphics, HTML files and other 
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documents and executables from general office applications; subscribe to a particular job and receive job output when a 

2) jobs — executable program files from applications such as job server 230 has executed the job. The distribution/ 

Brio. Report™, Oracle Reports, SAP Reports, etc.; 3) notification service notifies a user when relevant events 

Categories — user-defined groups of objects similar to file occur such as completion of a job or identification of a 

system directories or folders; 4) External links— a file which 5 particular exception. The event server 215 provides three 

encapsulates an Internet URL as well as metadata describing of notifications to users: 1) Report Completion— a 

the link; and 5 channels— software 'abstracts' of searches user 15 not,fied when an S ^ R Program or other job is 

that can be readily fine-tuned by selecting documents from executed, creating ;a new ^version of the job ^output; a user can 

current search results. Each object is assigned a property subsenbe to either scheduled or unscheduled jobs; 2) 

ii -i » 1T iir * a ~k mtk m r* * . / u ./ „ Changed Content in a category — a user is notified when the 

called a MIME type. A MIME type is the Multipurpose 10 . & t c & , \ , , -x XT 

w *. . - * j . § * contents of a category or subcategory changes; and 3) New 

Internet Mail Extension associated with an object. Versions of an obj^t-a user is notified when a new version 

Essentially, the MIME type describes the format of the data of an object is stQred in lhe repository or when an o5ject is 

on the object. The MIME type identifies which application upda ted. Notifications are provided to users in a variety or 

or job server 230 should be used to open an object. Each ways including e-mail, a link on the user's browser 

object placed in the repository for storage will be assigned 15 interface, or an icon that appears in the user's browser 

a single MIME type. interface. 

To provide for system security an authentication server Another tool for personalizing a user's portal page is the 
220 is provided. The authentication server 220 is responsible knowledge server 240, which is an optional component that 
for authenticating users who connect to the various service adds * search' features to the portal system 120. The knowl- 
agents in the portal system 120. For example, when a user 20 edge server 240 provides full text searching and concept 
100 logs into the portal system 120, the authentication server matching for documents located on Internet, intranet, and 
220 checks the user's credentials and either allows or portal sites. The knowledge server 240 is configured to 
disallows the user to connect. In addition, the authentication conduct searches upon both structured data and unstructured 
server 220 identifies all of the properties and group mem- data. Structured data is data that is stored in a format that 
berships assigned to a particular user 100. Some of the 25 facilitates processing by a computer such as databases or 
properties that can be associated with a user 100 include a structured filing systems like the repository 235. Unstruc- 
username, password, e-mail address, and permissions. The tured data includes information that is arranged in a format 
permissions associated with a user 100 define the ability of designed for review by humans such as news articles, press 
the user to read, write and execute objects stored in the releases, or any documents posted on the Internet to be read 
repository 235. A Group is used to define permissions for a 30 by humans. The knowledge server 240 can process struc- 
sctof users, rather than individual users. Accordingly, all the tured and unstructured data from a variety of locations 
members of a particular group will be given similar permis- including the Internet; a company's intranet, and the portal 
sions for a set of objects. The authentication server 220 may repository 235. By using concept ranking algorithms and 
be a server integrated into the portal system 120, or it may processes, the knowledge server 240 can qualitatively ana- 
be an external system that is electrically connected to the 35 lyze structured and unstructured data and present only those 
portal system 120. An external authentication server 220 is items (structured or unstructured) which are most relevant to 
useful when an external system already exists that defines a the user's search request. The documents that can be 
set of users 100, passwords, and group memberships. Com- searched by the knowledge server 240 include HTML 
munication between an external authentication server 220 documents, Microsoft documents (such as MS Word), PDF 
and a portal system 120 may be established by using a LDAP 40 files, text files, Brio.Query™ data files, and many others, 
driver. The knowledge server 240 has two components: a search 

By providing a job server 230, the portal system 120 server 245 and a crawl server 250. Each portal system 120 

enables a plurality of users to execute common jobs and to supports a single search server 245 and a single crawl server 

access the output reports of those jobs with a browser 250 that communicate with each other. These components 

program interface. The job server 230 executes external 45 are interrelated and cannot function without each other, 

programs, such as SQR programs, in the portal system 120. The crawl server 250 downloads documents from 

FIG. 2 illustrates that the job server 230 is electrically Internet, intranet, and portal sites and indexes them into a 

connected to the service broker 125, the repository 235 and database called an information source. Documents must be 

at least one back-end database 200, 205 or 210. When a user indexed into information sources before they can be 

100 transmits a request to the portal system 120 to execute 50 retrieved by a search. Crawlers, which are crawl server 

a particular job, the job is sent from the repository 235 to the agents, can navigate the portal, an intranet, and the Internet, 

job server 230 for execution. The job server executes the job according to certain predefined crawler properties. When a 

and returns the resulting job output to the user 100. In crawler begins executing, it starts at the first URL and 

addition, the job server 230 stores job output in the reposi- downloads the document. The crawler determines whether 

tory 235 as an object. The job server 230 can be configured 55 the document should be indexed based on the crawler 

to execute a variety of enterprise applications such as SQR properties and if so, it parses the document. If the document 

Server and Oracle Reports. Furthermore, a plurality of job is an HTML file, the crawler will follow hyperlinks to other 

servers 230 can be installed in a portal system 120 to allow documents and download them. A crawler can be configured 

parallel execution of job requests. By storing the output to gather documents from multiple URLs. If it is desired to 

reports from job servers 230 as an object in the repository 60 use the same crawler properties for several Web sites, then 

235, multiple users 100 can utilize dynamic open links to an administrator can create a single crawler to crawl these 

these objects within their personalized portal pages. Web sites. For example, it might be desirable to use a single 

The event server 215 provides three services in the porta! crawler to index a number of news sites and update the same 

system 120: scheduling services, subscription services, and information source, News, daily at 6:00 a.m. Conversely, if 

distribution/notification services. The scheduling service 65 one wants to use different crawler Properties for different 

dispatches pre-scheduled jobs for execution by one of the Web sites, then separate crawlers may be created to index the 

job servers 230. The subscription service allows a user to Web sites. 
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The crawl server shall be running in order for the crawlers 
to execute. If the crawl server is shut down, then all crawlers 
that are in progress will stop and crawlers scheduled in the 
future will not execute. Crawlers that are run interactively do 
not interfere with crawlers that are running based on a 5 
schedule. Hence, it is possible (though not useful) to run a 
crawler interactively while it is executing based on its 
schedule. The crawl server 250 can index sites that are 
accessible through proxy server or sites that require authen- 
tication. The crawl server 250 may include more than one 10 
crawler. The document references and other metadata iden- 
tified by a crawler are stored in information sources. Each 
crawler can be configured with certain parameters to control 
which documents to index into information sources. 

The search server 245 manages full text searching of 15 
documents that have been indexed into information sources 
by the crawl server 250. In one embodiment, the search 
server uses a proprietary search engine which is commer- 
cially available. The search server 245 may be configured to 
perform searches constructed by a user on an ad-hoc basis or 20 
to perform searches on a predefined schedule. These search 
results, particularly those of scheduled predefined searches, 
may be presented to the user on his or her personalized 
portal page or though portal objects. 

More than one information source may be configured in 25 
the portal. If there is a reasonable partitioning of the data on 
the repository 235, it may be desirable to maintain multiple 
information sources corresponding to each of these parti- 
tions. Indexing documents into separate information sources 
can help users 100 narrow searches to a particular informa- 30 
tion source to get more precise results. When structuring a 
search, choosing to use only information sources that con- 
tain useful information can eliminate extraneous documents. 
The best number of information sources to be searched will 
depend on how much precision and flexibility the user wants 35 
in constructing his searches. Too many information sources 
will clutter the interface and may lead a user to simply select 
all information sources, negating the purpose of having 
separated them. An administrator can set information 
sources to remove old documents after a specified amount of 40 
time, or move them into a different information source. 

The portal system 120 provides channels through which 
data can be dynamically provided to the user's personal 
portal pages or dashboards. The portal system 120 allows a 
user 100 to organize content from the search server's infor- 45 
mation sources and the repository 235 in channels. A chan- 
nel is a vehicle for organizing search results. A user can 
create and maintain channels for private use. For example, 
a user might search the company intranet and the Internet 
about the fishing industry in the Pacific Rim, then create a 50 
private channel called Fishing: Pacific Rim that will contain 
the query options specified in the search. Fishing: Pacific 
Rim will then appear on the left frame of this user's Personal 
tab and each time the user clicks on Pacific Rim, the web 
client runs the search and surfaces the results for that 55 
channel. Should the user 100 want to, she can retrain the 
channel to surface only results about fishing in Vietnam and 
call the retrained channel Fishing: Vietnam. Users 100 with 
write permissions to a category can publish a channel that 
will reside in that category. Users 100 must have read 60 
permissions to the channels they are publishing. For 
example, a user 100 who is a sales manager can publish an 
HR Forms channel in a sales category to which she has write 
permission. 

The Properties of the Portal System 65 

The characteristics and settings of the portal system 120 
are defined by using Properties. Properties describe the 
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characteristics and parameters of the service agents, jobs, 
schedules, and objects stored in the repository 235. The 
properties associated with each of these items are stored in 
a Relational Database. The Relational Database is adminis- 
tered by a Relational Database Management System 
(RDBMS). The properties associated with the different 
aspects of the portal system 120 are described below. 

An executable program and its associated files stored in 
the repository 235 are known as a job. Atypical example of 
a job is any kind report program, including SQR programs 
and other report applications. A job includes all of the 
information needed by a properly configured job server 230 
to execute a specific report or program. There are two kinds 
of jobs which may be processed by the portal system 120: 
SQR jobs and non-SQR jobs. An SQR job is a report or 
program that is written in Structured Query Language along 
with its associated files. A non-SQR job uses an application 
other than SQR such as Brio.Report™. Such a job comprises 
the report or program to be executed (for example, an Oracle 
report or a Crystal report), the script, batch file, or execut- 
able used to run the report or program, and any associated 
files (for example, GIF files, Include files, and so on). An 
SQR job may be either secure or nonsecure. A SQR program 
is secure if it uses the SECURITY command. 

FIG. 4 depicts the hierarchical arrangement of the prop- 
erties associated with a job. The properties associated with 
each job are stored in a relational database including the 
following groups: General Properties 400, Advanced Prop- 
erties 405, Associated Object 410, ASK Properties 415, 
INPUT Properties 420, Output Properties 425, Format Prop- 
erties 430, Options Properties 435, and Associated Forms 
440. The General Properties 400 associated with a job 
include the name of the job, a brief description of what the 
job does, a user 100 who is identified as the owner of the job, 
an expiration date for the job, an auto-delete flag, the group 
to which the job has been assigned, and the keywords 
associated with the job. The user 100 that is identified as the 
owner of the job will generally have fall permissions to edit 
and delete the job. The expiration date property is used to 
automatically delete the job after a specified period of time. 
The group property gives members of the assigned group 
permissions to access or modify the job. The keywords are 
used to make the job easier to find by a user 100 using the 
search feature of the portal. 

The Advanced Properties 405 associated with a job 
include the MIME type, the security mode flag, the rating of 
the job, a browsable flag, an exception flag, a background 
mode flag, a prompt-for-database-login flag, and permis- 
sions. The MIME type property indicates which program is 
used to open the job. The security mode flag indicates 
whether the job has been configured as a secure SQR job. 
The rating indicates whether the priority of the job output is 
Normal or High. The browsable flag indicates whether a user 
100 can see the job by using the browser user interface. The 
exception flag indicates whether the job can report Excep- 
tions. Exceptions are conditions that appear in the output of 
a job that require some intervention or threshold to monitor. 
Users 100 have the option to subscribe to an exception 
associated with a job. If a user 100 subscribes to an 
exception, then he/she will be notified, by e-mail or the 
exceptions dashboard, when an exception occurs during the 
processing of the job. The background mode flag indicates 
that the job is to be executed in the background, thus 
allowing a user 100 to perform other tasks while waiting for 
the job to complete. The prompt-for-database-login flag 
indicates that the user 100 will be prompted for a back-end 
database usemame and password when the job is executed. 
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Lastly, the permissions indicate the kind of access to be the job, and a compile flag indicating whether the job is to 

given to the owner, the assigned group, or any other user be compiled immediately or at a later time. 

100. The Form Properties 440 define the Forms which are used 

The Associated Objects properties 410 identify a list of during run-time to collect input for the job. A form can 

objects or files which are needed by the job to be executed 5 consist of a simple HTML page that contains a form, or it 

correctly. The Associated Objects include files required by can be a complex HTML page that invokes JavaScript or an 

the job at compile time, files required by the job at run time, applet. The form can also be a customized Web-based 

and files required by the job when generating report output parameter collection form that has been developed by the 

formats. enterprise for use in its jobs. When a form has been 

The ASK Properties 415 are used only for SQR jobs. 10 associated with a particular job, the form is stored in the 

These properties are used to prompt a user 100 to provide repository 235. A user 100 can assign a form from the file 

input at the time that a job is compiled. The ASK Properties system, or one that is already in the repository. Alternatively, 

can be provided to the job in several different ways: user a user 100 can choose to have the web client automatically 

input, command-line arguments, or entries in an associated generate a default Form. The form properties 440 include an 

ASK file. An ASK property may include either static param- is HTML Parameter Collection Form, a list of Files Required 

eters or dynamic parameters. With a static parameter, the by the HTML Form, a Show Parameter List when Running 

web client form contains a blank field where the user either the job flag, and a Save As My Defaults flag. The HTML 

types in a value or accepts the default. With a dynamic Parameter Collection Form describes the name of the form 

parameter, the web client form contains a drop-down list of to be assigned to a particular job. The list of Files Required 

values obtained from the back-end database. The user 100 20 by the HTML Form describes the supporting files, such as 

chooses one of these values by using the browser program. GIF images, used by the form for data collection. The Show 

Each ASK property will have several fields 445 including a Parameter List when Running the job flag is used to enable 

prompt to be provided to the user, the default value of the a drop-down list of existing parameter lists when the user 

input parameter, the name of a table to be used for a dynamic 100 is preparing to execute the job. The Save As My 

parameter, and a column name. 25 Defaults flag is used to indicate that a checkbox should be 

The INPUT Properties 420 are used to provide input to a provided to the user 100 when the job executes asking 

job when it is executed. An INPUT property may include him/her if the input settings should be saved as default 

text-field parameters, static-choice parameters or dynamic- values. 

choice parameters. For text-field parameters, the web client FIG. 5 illustrates the properties associated with a schedule 

form presents a text entry field with a default value. The user 30 stored in the event server. Jobs that are created in the portal 

100 may either type in a new value or accept the default system 120 can be set up to run on a predetermined schedule, 

entry. For a static-choice parameter, the web client form To schedule a job, it must be associated with a Time Event 

presents, at the time of job execution, a drop-down list, or a 505, a Parameter List 515, and a Schedule 500. Time events 

group of radio buttons showing a selection of values that has 505 define the timetable for running a job. Because time 

been assigned by the owner of the job. For a dynamic-choice 35 events are not necessarily associated with a particular job, a 

parameter, the web client form presents, at the time of job user 100 can utilize each time event 505 to schedule multiple 

execution, a drop-down list, or a group of radio buttons jobs. Typically, several standard time events 505 will be 

showing a selection of values obtained from the back-end present in the portal system 120. This allows a user 100 to 

database. The user 100 chooses one of these values at the select a time event 505 which best matches his needs. Each 

time of job execution by using the browser program. Each 40 time event 505 will have several properties 515 associated 

INPUT property will have several fields 450 including a with it including a brief description, a creation date, an 

prompt to be provided to the user, the type of input to be owner, a group, permissions, a start date and time, and a 

provided, and the default value of the input parameter, repeat interval. Parameter lists 510 define the compile- time 

The Output Properties 425 define the parameters which and run-time values necessary to execute a job. Each Param- 

are to be associated with the output files generated by a job. 45 eter List 510 will have several properties 520 associated 

Some of these parameters include an output-file displayable with it including: a parameter list name, a brief description, 

flag, an auto-delete flag, a propagate permissions flag, and a job name, ASK Properties 415, INPUT Properties 420, an 

permissions. The output-file displayable flag indicates assigned owner and group, and permissions. Many of these 

whether the web client can display the content of an HTML properties are interrelated to the job properties illustrated in 

output file instead of merely display a link to it. The 50 FIG. 4 and previously discussed, 

auto-delete flag indicates whether the output file is to be FIG. 6 illustrates some of the common properties associ- 
automatically deleted after a pre -determined period of time. ated with service agents in the portal system 120. In FIG. 6, 
The propagate permissions Hag indicates whether the same a list of installed service agents 600 is maintained in the 
permissions set for the job should be assigned to the output Relational Database in the portal system 120. Several prop- 
file. The permissions define the ability of certain users to 55 erties are associated with each of the installed service agents, 
access, edit or delete the output file. including the Name of the service agent 605, the Type of the 

The Format Properties 430 define the format for the SQR service agent 610, the Host where the service agent resides 

job output. Format options include Hewlett Packard laser jet 615, the Database Type 620 associated with service agent, 

printer (.hp), line printer (.lp), postscript (.ps), comma- and the Database Server 625 associated with the service 

separated value (.csv), Adobe Acrobat (.pdf), and Brio- 60 agent. As seen in FIG. 6, a list of the different service agent 

.Query™ data (.bqd). HTML output is always generated. If types 630 is maintained in the Relational Database in the 

the job is a secure SQR job, then only HTML output type is portal system 120. Thus, when a service agent is installed on 

available. the portal system 120, the administrator must add a new 

The Options Properties 435 define run-time parameters service agent type or select from the list of available service 

associated with a job. The options properties 435 include the 65 agent Types 630. Also depicted in FIG. 6 is a list of the 

username and password needed to run the job, the available service agent Hosts 635 which are resident on the 

command-line flags for the job, the application needed to run portal system 120. Again, when a new service agent is 
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installed on the portal system 120, the administrator must 
add a new service agent Host or select from the list of 
available service agent Hosts 635. More than one service 
agent type can be installed on a particular service agent host. 
For example, an authentication server 220 and a name server 5 
225 may both be installed the same host computer. FIG. 6 
also illustrates that each service agent will be assigned a 
Database Type 620 and a Database Server 625. A list of the 
available Database Types 640 and Database Server Types 
650 are maintained in the Relational Database in the portal 3Q 
system 120. Generally, only one Database Type 620 will be 
associated with a Database Server 625. 

FIGS. 7 and 8 depict some of the properties associated 
with the specific service agents of the portal system 120. 
Each repository 235 is assigned certain properties 700 
including a name, a host, a database type, and a database 15 
server. Each authentication server 220 is also be assigned 
certain properties 705 including a name, a host, a database 
type, a database server, a list of supported capabilities 710, 
and a table of drivers 715. The capabilities 710 define 
whether the authentication server 220 may be used to create 20 
and modify users and groups for the portal system 120. The 
table of drivers 715 defines the authentication drivers which 
may be utilized to authenticate users such as LDAP. 

Each job server 230 is assigned certain properties which 
are illustrated in FIG. 7. As with all other service agents, 25 
each job server 230 is assigned a name, host, database type, 
and database sever. In addition, each job server 230 is also 
assigned an Application 720, a Program 725, a job server 
Class 730, and a SQR Server 735 if the job server will be 
* used for SQL jobs. The Application 720 is typically a 30 
third-party vendor application designed to run in the back- 
ground. Application examples include Brio Technologies 
SQR, Oracle Reports, or public domain application shells 
such as PERL. A Program 725 is typically a source used to 
drive a specific invocation of an application. For example, a 35 
user 100 might submit an SQR program that generates a 
sales report to an SQR application on a given host through 
a job server 230. The Job Server Class 730 property iden- 
tifies what kind of job server is installed and the SQR Server 
735 defines what kind of SQR Server is installed (i.e. SQR 40 
V4.3 for Sun/Solaris/ORACLE). Each of the Application 
720, Program 725, job server 730 and SQR Server 735 
properties will have certain sub-properties assigned as well. 

The search server 245 is assigned certain properties which 
are illustrated in FIG. 8. As with all other service agents, the 45 
search server 245 is assigned a name, host, database type, 
and database server. In addition, each job server 230 is also 
assigned Search Engine Properties 800 and Information 
Sources 805. The Search Engine Properties 800 describe the 
operating parameters for the search engine, including a 50 
Query Port value, an Index Port value, a Language, an Index 
Hyphenated Words flag, and Hyphen Character. The Query 
Port value identifies that port which will handle simulta- 
neous queries from a user. The Index Port value identifies the 
port which receives indexing requests from the crawl server. 55 
The Language specifies the language in which month names 
and abbreviations of month names appear. The Index 
Hyphenated Words flag indicates whether the Search Engine 
indexes a hyphenated word as well as individual words. The 
Information Sources properties 805 describe the names and 60 
properties of each Information Source that has been indexed 
by the crawl server. Some of the properties include a Name, 
a Description, a Number of Documents which may be stored 
in the Information Source, an Expiration Date, and an 
Expiration Action for the Information Source. 65 

Each channel is assigned certain properties which are 
illustrated in FIG. 8. Some of these properties include a set 
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of General Properties 810, Advanced Properties 815, Query 
Properties 820, and Information Sources 825. The General 
Properties 810 define the name of the channel, a brief 
description, the owner and group of the channel, and the 
expiration date of the channel. The Advanced Properties 815 
for each channel include a Browsable flag and Permissions. 
The Browsable flag indicates whether the channel is visible 
to the user 100 using the Browser interface. The Query 
Properties 820 define the query to be provided by the 
channel to the search server. These properties include the 
Query Phrase, the minimum relevancy for the search, the 
document types to be identified, a test to determine if the 
query had been modified in the past, a test to determine if 
any exception items are present, a test to determine if any 
high priority documents are present, and a delete training 
history flag. Also included are settings determining how 
search results should be sorted (i.e. by relevancy or last- 
modified date) and a display document summary flag. The 
Information Sources 825 associated with a channel define 
which information sources which will be searched when a 
particular channel is executed. 

FIG. 9 depicts certain properties that are assigned to each 
crawler in the crawl server 250. These properties include 
General Properties 900, Schedule Properties 905, Limits 
910, Directory Properties 915, Web Pages Properties 920, 
Proxy Properties 925, Authentication Properties 930, Cook- 
ies Properties 935, Protocol Properties 940, Date Format 
Properties 945, Text File Properties 950, and Status Prop- 
erties 955. Each of these sets of properties is described 
below. 

The General Properties 900 define the basic parameters to 
be used by each crawler in conducting its search. Included 
in these parameters are the name of the crawler, the infor- 
mation sources to be searched by the crawler, a list of the 
URLs to crawl, a flag indicating whether the URLs are case 
sensitive, and a flag indicating whether the crawler should 
follow links to other sites. 

The Schedule Properties 905 are used to define the 
schedule on which the crawler operates. Included within the 
Schedule Properties 905 are a schedule (as disclosed in FIG. 
5), a start time, and a repeat interval. 

The Limits 910 are used to prevent the crawler from 
infinitely indexing sites which have deep links. Some of the 
limits which may be defined for a crawler include a maxi- 
mum depth (which specifies how many levels of links a 
crawler may follow), a site duration (which defines the 
number of hours a crawler should spend on any given site), 
a page delay (which defines the number of seconds the 
crawler should wait after downloading a page before moving 
to the next link of set of links), and a page timeout (which 
defines the number of seconds that the crawler will wait to 
receive data after requesting a page). 

The Directory Properties 915 provide an Allowed field 
and a Disallowed field. The allowed field defines a set of 
strings that must exist in a URL object for that object to be 
indexed. The disallowed field defines a set of strings that 
must not be in the URL object in order for the object to be 
indexed. These fields may be used to greatly limit the 
number of sites which are indexed in an information source. 

Web Pages Properties 920 are used to exclude HTML and 
text files from being indexed in an information source based 
upon the content of the document and/or header. Using these 
properties, a user can specify those strings that must exist 
and/or those strings that must not exist. If either of these 
constraints are not met, then the HTML document will not 
be indexed. For example, if a user does not want any HTML 
or text files to be indexed that contain the phrase "living 
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abroad," the term "living abroad" is added as a Cannot Have 

Text string. Or, if a user desires to index only all the HTML TABLE 1 

or text files that contain the phrase "electronic filing," then 

the term "electronic filing" would be added as a Must Have * * car * s 4 digits; 1 ? 9 ^ a 5° > ^ m « 

& , MM - Month as a numeric two digit number; 01, 12 

Text String. HTML Or text files must meet both Constraints 5 SHOHTMONTH - Abbreviation for the month; Jan, Sep 

in order to be indexed in an information source. If a LONGMONTH- Long month format; January, September 

document meets both the Must Have constraint and the DD - Day as a two-digit number; 08, 31 

Can'. Have constraint, then the Can't Have constraint takes £ ; » * ^ ^ * 31 

precedence and the document will not be indexed. H - Hour as either a one- or two-digit number; 1, 10 

Accordingly, the web pages properties will include two 10 nn - Minute as a two-digit number 

fields: a Must Have Text field and a Cannot Have Text field. N+ - ^ nute as either a one " or two-digit number 

Other parameters may also be set to look for these strings in 5? " 5^°^ as a num ^ r . t . 

, , , J *=r S+ - Second as either a one- or two-digit number 

the headers or content only. xzz - Time Zone; GMT, EST, PST 

Proxy Properties 925 are used to define the proxy param- — — — — — — 

eters if the portal is connected to the Internet through a proxy 35 ^ ^ ^ p ^ 95Q what extensions tQ 

server. The Proxy Properties 925 mclude the host address of treat as text mes; for example, TXT, CSV, XML, and so on. 

the proxy server, the port on which the proxy server listens, Entering these extensions in the text file properties screen 

and the user name and password for connecting to the proxy mea ns that if the crawler finds a file with one of these 

server, extensions, it will treat the extension as a text file. MS and 

The Authentication Properties 930 define how the crawler 20 PDF files will automatically be treated as text files, 

is to log into websites that require a login. Three different The Status Properties 955 displays information on how 

types of login authentication methods are available: HTTP the crawler is currently performing. Some of the parameters 

authentication, forms-based authentication, and cookie- identified as status properties include the number of seconds 

based authentication. Sites that display a dialog window lhal the crawler nas been running since the start of the 

requesting a user name and password are generally using 25 current ™ a > number °f P a S es ^at the crawler has 

http authentication. A user name and password usually have downloaded since the start of the current run the number of 

to be provided so that the crawler can log into the website. URLs that retum error 404 ' and the number of P a S es that the 

Sites that have a page containing a login form are usually crawler does not have authorization to access. The Status 

forms-based authentication. These are fairly difficult to Properties 955 aUow an admimstra tor to d^play the status of 

configure because there are many entries that can be defined 30 the last ™ n > *° craN f er oa hold >. and t0 test the 

on the form. Additional login information may need to be crawler " ™ e scheduled crawler state provides the crawler 

specified in the cookie tab. Accordingly, the Authentication ? atls ) tlcs re S ardin g the ] asl time that . crawler executed 

Properties 930 will indicate which authentication protocol b <* td on . lts schedu ^ £ th f crawler » cuncntly executing, 

should be used for a site as well as user names and these statistics wiU be displayed. It does not provide statis- 

passwords necessary for logging in. 35 tics when the crawler was executed interactively. When a 

_ . . , , . , crawler is placed on hold, the crawler will not be allowed to 

Cookies Properties 935 are used to specify additional eX6Cut6 based OQ ^ Ho me hoW wiu nQt 

login information as well as cookies that should be sent to affect me abi% , 0 ^ tfae crawl6f int6ractivel A crawler 

a webserver, rhe additional login information allows the should be , esled ^ creati ft tQ vedf ft behayes ^ 

crawler to log into a site in which a form requires more than m desired fi checki the , m information, and admin- 

just a user name and password. For example if the form has 0f ^ WQ ^ ^ wha( documents &K bei 

an input field that specifies a user s age, then age can be downloaded ^ discarded . when testing a crawler, the 

specified as additional login information. Therefore, the documents ^ nol ^extd into information sources, 

field s name and value should be added to the cookie as ^ e p orta j p aee 

additional login information. Often a form-based login will cir , 1ft A „J~- 1c n ™ MOA „ tQt :„« ^ i„ rtf ^ to i 

, . » . , . ... , . 45 MU. 1U depicts a representative example 01 a portal page 

result in a cookie being submitted to a user s browser 1(m ^ ^ froffl a ^ brQWSer m A d 

interface. The cookie contains information about the user s im fa , cuslomized web mat B dala &om the 

login parameters insuring that the user and only that user see , m ^ ^ mQst tQ a icular ^ 

the conten on that website. lUe cookies properties 935 A rta , im 

is a user's primary interface to the data, 

allows coofoe-based login, or cookie spoofing. Using this 5Q f and jobg ^ are {n ^ { m m 

method a crawler can sorohmes leap frog the login process £ach Qf tfae ^ q{ daU ^ afe ^ lQ ft ^ &t ±c 

aU together and insure tha the crawbr appears logged in as { 1(m ^ ^ & a , object ^ im 

soon as it arrives at the site. Once the names and values of ^ { ^ {Q a ^ WQ {n a format 6 that is 

the cookies have been determined, they may be added to the readable b a standard brow&er am The ^ defau]t 

cookies page of the crawler, thus allowing the crawler to 55 { mo fe ^ ^ fae ^ after { ^ ^ 

logm when it starts. Of course if the cookie expires then (he al m ^ flfSt ^ a usef { in{Q {hQ 

cookie-based authenncation will work for only the duration ^ system 12Q> a poftal page 100Q ^ au f omaticaUy 

that the cookie is valid. generated. Thereafter, a user 100 can modify his respective 

The Protocol Properties 940 allow an administrator to copy of the portal page 1000, and can create additional ones, 

specify the HTTP protocol version as either 1.1 or 1.0. An 60 ij sers can modify the content, layout, and colors of any of 

administrator is also allowed to specify whether secure their portal pages 1000, as well as changing which portal 

sockets layer (SSL) is to be used in retrieving "https://" page iqoo is the default page (the one that displays at the 

U^Ls* beginning of a portal session). Users to whom permission is 

The Date Format Properties 945 define the specific date granted can publish their own portal pages for others to 

format and language to use when displaying search results 65 copy. Users can add additional components to any of their 

with the web client. Some of the date formats which may be portal pages 1000, or remove optional components. In 

utilized are listed below in table 1. addition to using pre-con figured content provided by the 
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administrator, a user can create and include on his portal exception condition has been found in a particular job 

page 1000 other content of interest to him, such as processed in the portal system 120. An exception condition 

bookmarks, channels, and job output files displayed as portal is tied to the results of a job executed within the portal 

objects. system 120. Only certain jobs within the portal system 120 

In FIG. 10, the portal objects are generally arranged in 5 can be configured to indicate an exception condition. In 

three columns. The left-most column 1001 is entitled "Per- addition, a user 100 is required to subscribe to an exception 

sonar and contains a set of links which may be selected by associated with a job before he can be notified of the 

the user with the browser program. These links allow the exception condition via his portal page 1000 The exception 

user to access certain "favorite" and "my" objects which dashboard window 1020 can be configured by a user to 

have been previously defined. The middle column 1002 10 dus P* a 7 f m * C * t0 t r when m exception condition is met by 

. . , c * t , . . . . * . (( „ a particular job that was processed by the portal system 120. 

includes four tabs at the top entitled personalize, i n FIG. 10, the exception dashboard is configured to display 

content, layout, and edit. These tabs are used to a traffic light which is green when no exceptions are present 

customize the appearance of the user s personal portal page. and red when m exception condition was indicated by a job. 

Below these tabs are four portal objects entitled Broadcast Xhe right . most of the portal page 1000 of FIG. 10 

Messages 1005, Company Billboard 1010, My Bookmarks 15 i nc i u des a display window 1025 and another portal object 

1015, and the Exceptions Dashboard 1020. The right-most entitled E-commerce Top Stories 1030. The display window 

column 1003 includes one display window 1025, and 1025 is used to display the objects and reports requested by 

another portal object entitled E-commerce Top Stories 1030. a user 100 during a session. Generally, the requested objects 

Each of these portal objects is described below. must be in a format that is capable of being read by a 

In FIG. 10, the Broadcast Messages portal object 1005 is 20 browser program in order to be displayed in a the display 

configured as a mandatory portal object. Thus, it cannot be window 1025. In FIG. 10, the personal dashboard portal 

modified or removed by a user 100. Accordingly, the Broad- object is being displayed in the display window 1025. 

cast Message portal object in FIG. 10 does not include an The portal object illustrated at the bottom of the right - 

"X" icon 1008 in the upper right hand corner which would most column 1003 of the portal page 1000 is a syndicated 

allow a user to delete this portal object from their portal page 25 content portal object entitled E-commerce Top Stories 1030. 

1000. Beneath the Broadcast Message portal object 1005, is A syndicated content portal object is used to present dynami- 

the Company Billboard portal object 1010. The Company cally updated content provided by a third party. Third party 

Billboard portal object 1010 is a precon figured category content could include information such as a news- wire, a 

which includes a list of links which may be accessed, but not stock quote service, or a sports score reporting service. In 

edited by the user 100. By clicking on one of these links, a 30 FIG. 10, the third party content that is provided in this 

corresponding object will be displayed in the display win- window is a news-wire service related to E-commerce 

dow 1025 on the right-hand side of the portal page 1000. In stories. 

the example of FIG. 10, the Personal Dashboard link in the The personal dashboard is an object that can be person- 
Broadcast Messages category 1005 has been selected by a alized and configured by a user to display a variety of objects 
user; thus causing the Personal Dashboard portal object to be 35 from the portal system 120 as well as content obtained from 
displayed in the display window 1025 of the portal page the Internet. For example, in FIG. 10, the user 100 has 
1000. Because the Company Billboard 1010 has been set up configured his personal dashboard to display two image 
as a preconfigured category, a user 100 cannot edit the links bookmarks: the "Just-In-Time Sales Report" 1035 and the 
which are listed in this window. A user 100 does have the "Operations Dashboard" 1040, a chart entitled "Sales by 
option of removing this portal object from his personalized 40 Product Analysis" 1045, a set of current weather information 
portal page by selecting the "X" icon 1008 in the upper right 1050, and a continuously updating banner describing other 
hand corner of the Company Billboard portal object. information 1060. The two image bookmarks are links to 

Below the Company Billboard 1010 is the My Bookmarks other objects which may be displayed in the display window 

portal object 1015. The My Bookmarks portal object 1015 is 1025. Thus, by clicking on either of these icons, a corre- 

set up as a standard component which means that a user 100 45 sponding object would then be displayed in the display 

has the ability to modify this portal object to include the window 1025. The "Sales by Product Analysis" chart 1045 

content that he desires. Within the My Bookmarks object are is a dynamically updated portal object that displays the 

links to other objects within the repository 235 or to other results of a job that was recently executed by the portal, 

sites on the Internet which the user 100 deems to be relevant. Every time that the corresponding job is executed, an output 

New bookmarks may be added to this category by the user 50 file would be generated which would then be displayed on 

100 by pressing the "New Bookmark" link at the bottom of the user's personal dashboard. Thus, by merely displaying 

the My Bookmarks portal object 1010. In addition, a user the personal dashboard, a user may view an image of the 

100 can edit the bookmarks residing in this portal object by most recent sales by product analysis, or any other job 

pressing the "Edit" button located at the top right-hand report. This portal object may also be con figured as an image 

corner of the My Bookmarks portal object 1010. Pressing 55 bookmark so that a larger image of the graph will be 

the "Edit" button in the My Bookmarks portal object will displayed by clicking through. The set of current weather 

cause an interactive form to be displayed in the display information 1050 indicates that the personal dashboard may 

window 1025, thus allowing the user 100 to edit the content be configured to display information from the Internet as 

of the portal object. A user can also delete the My Book- well as objects from the portal. Lastly, the continuously 

marks window from the personalized portal page 1000 by 60 updating banner describing other information 1060 illus- 

selecting the "X" icon 1008 in the upper right hand corner trates that any data may be configured to be displayed on the 

of window, personal dashboard as long it is readable by a standard web 

The last window displayed in the middle column of the browser. Thus, the personal dashboard can display a variety 

portal page 1000 is the Exceptions Dashboard 1020. The of structured and unstructured data in a standardized format, 

exceptions dashboard 1020 is set up as a standard compo- 65 The Portal Processes 

nent which may be edited and configured by a user. The The login process for a user can be described by refer- 

exception dashboard 1020 is used to indicate when an encing FIG. 2. A user 100 can be connected to the portal 
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system 120 by accessing a computer which is connected to asynchronously so that the service broker 125 is freed up to 

the portal's network server 105. Initially, the user 100 sends perform other tasks while the job is executing. It should be 

a request to the network server 105 for access to the portal noted that a job will often include several objects necessary 

system 120. The user 100 is then prompted to provide a for the execution of the program other than the executable 

username and password for the portal system 120. This 5 file. These objects may include metadata corresponding to 

information is then passed firom the web client 115, to the the job, forms objects, and INPUT objects. The job server 

service broker 125 to the authentication server 220. At the 230 uses all of these objects to execute the job request, 

authentication server, if the user 100 is identified as a valid A job may require a fresh set of data to be retrieved from 

user, then a session token is sent to the service broker 125 a back-end database 200, 205, or 210. If this is the situation, 

which grants the user 100 access to the portal system 120 for 10 then the job will be dispatched to a job server 230 that is 

a period of time. If no activity is detected by the service connected to an appropriate back-end database. After the 

broker 125 for a certain period of time, then the user's data is retrieved from a back-end database, it is processed by 

session is closed and he will be forced to log in again. the job server 230 and an output report is prepared. In many 

Information about the user's group affiliation and system instances, the output report will have to be transformed into 

permissions are also stored in the authentication server 220. 15 a format that is appropriate for storage and/or presentation to 

Thus, the authentication server 220 determines what level of the user. This process is performed by the job server 230. 

access to be given a user 100 based upon his permissions. After the output reports have been converted into the appro- 

For example, a user 100 may only be given permission to priate format, then the job server analyzes the job metadata 

view certain categories or objects in the repository 235. All to determine if there are any subscriptions or notifications 

other categories and objects in the repository are off-limits 20 which need to be fulfilled. The job server 230 will also test 

to the user. The service broker 215 would therefore bar the the output report to determine if any exception conditions 

user from accessing any "off-limits" objects and categories are present. The job server 230 utilizes the job metadata to 

and would only allow him to access permitted areas within determine if any of the exception conditions have been met. 

the portal system 120. If so, appropriate notifications will be sent to the subscribing 

After a user 100 logs in to the system, he will first be 25 users via e-mail or portal object update (such as notification 

presented with his personalized portal page 1000 on his on a dynamic portal object). In addition, the job server 230 

browser software. The process by which this is accom- will assign user and group permissions to the output report, 

plished is described below. After a session is established at These permissions define the users and groups that can 

the service broker 125 of the portal system 120, the service access and view the output report. The output report will 

broker retrieves a set of metadata corresponding to the user's 30 then be transmitted to the service broker 125 so that it can 

personal portal page 1000 from the repository. This metadata be forwarded to the user who requested it. The output report 

indicates which portal objects should also be retrieved from will generally be displayed in the display window 1025 of 

the repository 235 in order to populate the portal page 1000. the user's personalized portal page. A copy of the output 

After all of the portal objects have been retrieved from the report may also be stored in the repository 235. 

repository, they are assembled into a format which can be 35 Another condition that may be encountered by the job 

read by a browser program. After this, the personalized server 230 during the execution of a job is the requirement 

portal page 1000 is transmitted to the appropriate user 100 of input for the job. In some instances, the input can be 

through the network server 105. At the user's interface, his provided by an INPUT object associated with the job in the 

browser program will display the personalized portal page repository (see item 420 of FIG. 4). In other instances, the 

1000 to the user. The process of assembling a personalized 40 input must be provided by the user 100 as the job executes, 

portal page 1000 is repeated every time a user 100 refreshes In this situation, an ASK form associated with the job will 

his browser screen. Because each portal page 1000 is be used to solicit input from the user 100. When a job server 

assembled on an ad-hoc basis using the most current ver- 230 requires input, an ASK form will be transmitted to the 

sions of the portal objects in the repository 235, the data service broker 125 by the job server 230. The ASK form will 

presented on a newly refreshed portal page 1000 will present 45 then be presented to the user 100 at the user's browser 

the most recent data available in the portal system 1000. interface. If a job requires input from a user, and there is no 

A user 100 may select jobs to be processed by the portal corresponding ASK form, then the portal system 120 may 

system 120 by performing the following steps. First, a user construct an ad-hoc input form which can be presented to the 

will select a link or object on his personalized portal page user. After the user 100 has made his input selections on the 

1000 which corresponds to a job. For example, in FIG. 10, 50 form, the input data is transmitted to the job server 230 

a user 100 may select the first link in the company billboard through the service broker 125. The input data is then 

portal object 1010. Or, a user 100 may select the "Just-in- utilized by the job server 230 to complete the execution of 

Time Sales Report" image bookmark 1035 in the personal the job. 

dashboard. Either of these selections will transmit a request Another feature of the portal system is the creation of 

to the portal system 120 to execute a job and return the 55 secure bursted output reports. These output reports contain 

results to the user 100. Referring again to FIG. 2, when the embedded permission markings which restrict the ability of 

request to execute a job is received by the service broker some users and groups to view some portions of an output 

125, it is first determined if the user has permissions to report. Thus, when a user with limited permissions views a 

execute this job. This is done by polling the authentication secure bursted output report, he will see only those sections 

server 220 to create a user context for the job. The user 60 which he has been given permission to see. The secure 

context will include the user's username, group information, bursted output report feature works best when the output 

default category, and default permissions. If the user 100 is report has been bursted into multiple files with each file 

found to have appropriate permissions for the job, then the containing one or more HTML pages. When an output report 

job will be retrieved from the repository 235 and sent to the is created in the job server, each file is tagged with a set of 

service broker 125. The service broker will then dispatch the 65 user and group permissions defining the ability of certain 

job (along with the user context) to an appropriate job server users and groups to access all of the HTML pages in the file. 

230 for processing. Generally, the job will be dispatched A master file containing a plurality of containers is also 



03/01/2004, EAST version: 1.4.1 



US 6,643,661 B2 

25 26 

generated. Each container is used as a reference to one of the the appropriate job server 230 for execution. The event 

bursted files. Thus, by assembling all of the bursted files into server 235 will make it appear that the user 100 that 

the master file, a complete output report may be generated. scheduled the job is the user 100 that is running the job 

Accordingly, an output report with multiple containers, each through the scheduler. Third, the event server 235 sets the 

of which having different levels of permissions, yields the 5 parameter values for the job with the parameter values that 

most flexibility. were input by the user 100 when the job was scheduled. This 

To retrieve a secure bursted output report from the would include both ASK and INPUT values for SQR jobs 

repository, the user must first have logged into the portal and input values for generic jobs. Fourth, the event server 

system 120. The process of logging into the portal estab- 235 sends the job to the service broker 125. The service 

fishes a user context which contains, among other things, the 10 broker 125 will route the job to an appropriate job server 230 

useraame, user permissions and group information. After a for execution. The command that is used to send the job to 

user is logged in, the portal system 120 uses the following the job server 230, via the service broker 125, is a command 

steps to determine which parts of a bursted secure job output that will cause the job to run asynchronously. The command 

to provide to a user. First, when a user requests a secure also contains distribution information that allows the job 

bursted output report, the normal permission checking is 15 server 230 to distribute output reports and send e-mail 

done to determine if the user has the appropriate permissions notifications when the job is completed. When a job runs 

to read the master output report. If the user does not have asynchronously in the job server 230, a unique thread is 

appropriate permissions, then an error message is returned to created to run the job and control returns to the event server 

the user and the retrieval process is ended. Second, the 215. This mechanism allows the event server 235 to send 

repository checks the permissions for each container in the 20 numerous scheduled jobs to the service broker 125 for 

master output report. If the user has appropriate permissions execution without waiting for previous jobs to finish. The 

for the container, then the special tags that were assigned to event server 235 supports a user configurable concept called 

each file when the output was created are checked. If the user retry processing. If the event server 235 fails to send the job 

name is in the list in the tags or the user is a member of one to the job server 230, one of two things happens. If retry 

of the groups listed in the tags, then this container is added 25 processing has been disabled, an error notification will be 

to the set of containers to be returned as part of the master sent to the e-mail address specified when the job was 

output report. If the user does not have appropriate scheduled. No attempt will be made rerun the job. If retry 

permissions, then the container is not added to the set of processing is enabled, then an e-mail will be sent informing 

containers to return. No errors are returned from this step. the user that the job could not be sent to the job server 230 

The fact that a user cannot see a particular container is not 30 and it is being resubmitted. The job is then added to a retry 

necessarily an error but a function of the way the secure queue so the event server 235 can try to run it later. Fifth, 

bursted output program operates. After this process is fin- when control is returned to the event server 235, the meta- 

ished and a list of viewable containers in the secure bursted data for the scheduled job is updated to reflect the time it was 

output report is determined, the contents of each approved run and the number of times it has run is incremented, 

container are retrieved from the repository and are presented 35 Jobs that are placed in the retry queue are retried using the 

to the user. following algorithm. Each job will be retried until they run 

In the event server 215, jobs may be associated with successfully or until the event server 235 is stopped. When 

certain time events. These time events control when jobs a job is retried, it follows the five steps outlined above, 

run. Jobs which can be associated with time events include When a job is first placed in the retry queue it is set to run 

batch jobs that are to be processed in the job servers, 40 10 minutes after the time it is placed in the queue. After each 

searches to be conducted in the knowledge server, or retriev- attempt to run the job, if it fails, it is resubmitted to the retry 

ing channels from the repository for processing in the queue with an additional 10-minute wait time added until the 

knowledge server. When the event server 215 starts, it builds next retry. In other words, jobs submitted to the retry queue 

an ordered list of time events that are known to the system. will be retried first in 10 minutes, then 20 minutes, then 30 

The time events are ordered by the next time that they are 45 minutes, etc. all the way up to 120 minutes. After the retry 

scheduled to run. Each time event may have one or more period hits 120 minutes it stays there. Thus, if a job that is 

jobs scheduled against it. The job information includes data retried after 120 minutes and fails, it will be retried in 

about the job properties and distribution/notification infor- another 120 minutes. The 120-minute maximum delay is 

mation. The data necessary to build this list is retrieved from user configurable. 

the database that contains the event server 215 metadata. 50 When the event server 235 is first started, a sweeper 

Once the ordered list is generated, the event server 215 thread is started that looks through the metadata about time 

calculates the amount of time until the first time event is events and scheduled jobs. The sweeper thread looks for 

scheduled to run and goes to sleep for this amount of time. scheduled jobs that should have run in the past but didn't. 

The following steps are performed when the event server This would include jobs that were queued up or in the retry 

215 awakens and collects the set of jobs associated with the 55 queue when the event server 235 was stopped. When such a 

time event scheduled to run at that time. These actions are job is found it is retried using the five steps described above, 

performed for each scheduled job. First, the event server 215 Although the portal system has been described in this 

sends a transaction to the repository 235 to retrieve a job that specification by way of description, examples and drawings, 

is to be run. If the job is no longer in the repository 235, then it should be noted that various changes and modifications 

the event server 235 will remove the scheduled job from the 60 may be apparent to those skilled in the art. In particular, 

event server 235 metadata and move on to the next sched- many modifications in the structure and operation of the 

uled job. Second, the event server 235 sends a transaction to hardware and software may be made in the implementation 

the authentication server 220 to get information about the of the embodiments described above. Any such changes and 

user 100 that scheduled the job. That information includes modifications should be construed as being included within 

the useraame, group information, default folder and default 65 the scope of the inventors' original conception, unless they 

permissions. This information is used to create a user depart from the scope of the invention as defined by the 

context. This user context is used when sending the job to claims. 
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What is claimed is: 

1. A computer system configured to search for structured 
and unstructured data objects, the computer system con- 
nected to a network interface that may be connected to at 
least one user through a computer network; the computer 5 
system comprising: 

a service broker electrically connected to the network 
interface, the service broker adapted to control a level 
of access to the computer system by a user; 

a repository electrically connected to the service broker, 10 
the repository comprising a computer memory encoded 
with a plurality of objects including structured data 
objects and unstructured data objects; 

a knowledge server electrically connected to the service 15 
broker and to the repository, the knowledge server 
comprising: i) a computer memory encoded with at 
least one information source comprising a searchable 
index describing at least one set of structured and 
unstructured data objects; ii) a crawl server configured 2Q 
to manage at least one crawler, wherein said at least one 
crawler is configured to process a set of structured and 
unstructured data objects and generate a searchable 
index describing the set of objects, and wherein said at 
least one crawler is configured to store the searchable ^ 
index in a corresponding information source; and iii) a 
search server configured to process a search request so 
as to generate a search result, wherein the search 
request comprises at least one query term, a corre- 
sponding set of search properties, and a list identifying 3Q 
at least one information source to be searched, and 
wherein the search result comprises a list of objects 
identified during the search. 

2. A computer system according to claim 1, wherein at 
least one user is electrically connected to the network 35 
interface through a computer network. 

3. A computer system according to claim 2, wherein the 
knowledge server is configured to receive a search request 
from said at least one user and to provide a corresponding 
search result to said at least one user. 

40 

4. A computer system according to claim 1, wherein the 
set of structured and unstructured data objects includes data 
objects outside the computer system. 

5. A computer system according to claim 4, wherein the 
set of structured and unstructured data objects includes data 45 
objects on the Internet. 

6. A computer system according to claim 1, wherein the 
knowledge server further comprises: 

a computer memory encoded with a first information 
source comprising a searchable index corresponding to 50 
a plurality of sets of structured and unstructured data 
objects; 

a crawl server configured to manage a plurality of 
crawlers, wherein each of said crawlers is configured to 
process a unique set of structured and unstructured data 55 
objects and generate a searchable index describing the 
unique set of objects, and wherein each of said crawlers 
is configured to store the generated searchable index in 
the first information source. 

7. A computer system configured to search for structured 60 
and unstructured data objects, the computer system con- 
nected to a network interface that may be connected to at 
least one user through a computer network; the computer 
system comprising: 

a service broker electrically connected to the network 65 
interface, the service broker adapted to control a level 
of access to the computer system by a user; 
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a repository electrically connected to the service broker, 
the repository comprising a computer memory encoded 
with a plurality of objects including structured data 
objects and unstructured data objects; 

a knowledge server electrically connected to the service 
broker and to the repository, the knowledge server 
comprising: i) a computer memory encoded with at 
least one information source comprising a searchable 
index describing at least one set of structured and 
unstructured data objects; ii) a crawl server configured 
to manage at least one crawler, wherein said at least one 
crawler is configured to process a set of structured and 
unstructured data objects and generate a searchable 
index describing the set of objects, and wherein said at 
least one crawler is configured to store the searchable 
index in a corresponding information source; and iii) a 
search server configured to process a search request so 
as to generate a search result, wherein the search 
request comprises at least one query term, a corre- 
sponding set of search properties, and a list identifying 
at least one information source to be searched, and 
wherein the search result comprises a list of objects 
identified during the search; 

wherein the set of search properties includes a relevancy 
property defining a minimum relevancy score that an 
object must have to be included in the corresponding 
search result, a document type property defining a 
document type that an object must be to be included in 
the corresponding search result, and a date modified 
property defining a date beyond which an object must 
have been modified to be included in the corresponding 
search result; 

wherein the computer memory of the knowledge server is 
further encoded with at least one set of crawler prop- 
erties corresponding to one of said at least one crawlers, 
each set of crawler properties including: 
a name of a corresponding crawler; 
a list of URLs to be processed by the corresponding 
crawler; 

a set of limits defining a number of links to follow 
beyond a starting URL; 

a first set of constraints defining terms which must be 
included in an object in order for it to be indexed by 
the corresponding crawler; 

a second set of constraints defining terms which must 
not be included in an object in order for it to be 
indexed by the corresponding crawler; 

a set of authentication properties providing access to 
secure objects in said set of structured and unstruc- 
tured data objects by the corresponding crawler; and 

a schedule defining a time when the corresponding 
crawler is to begin execution. 

8. A computer system according to claim 7, wherein at 
least one user is electrically connected to the network 
interface through a computer network. 

9. A computer system according to claim 8, wherein the 
knowledge server is configured to receive a search request 
from said at least one user and to provide a corresponding 
search result to said at least one user. 

10. A computer system according to claim 8, wherein the 
set of structured and unstructured data objects includes data 
objects outside the computer system. 

11. A computer system according to claim 10, wherein the 
set of structured and unstructured data objects includes data 
objects on the Internet. 

12. A computer system according to claim 10, wherein the 
knowledge server further comprises: 
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a computer memory encoded with a first information 
source comprising a searchable index corresponding to 
a plurality of sets of structured and unstructured data 
objects; 

a crawl server configured to manage a plurality of 5 
crawlers, wherein each of said crawlers is configured to 
process a unique set of structured and unstructured data 
objects and generate a searchable index describing the 
unique set of objects, and wherein each of said crawlers 
is configured to store the generated searchable index in 10 
the first information source. 

13. A computer system configured to search for structured 
and unstructured data objects, the computer system con- 
nected to a network interface that may be connected to at 
least one user through a computer network; the computer 15 
system comprising: 

a service broker electrically connected to the network 
interface, the service broker adapted to control a level 
of access to the computer system by a user; 

a repository electrically connected to the service broker, 
the repository comprising a computer memory encoded 
with a plurality of objects including structured data 
objects, unstructured data objects, and at least one 
channel, wherein the channel comprises at least one 25 
query term, a corresponding set of search properties, 
and a list identifying at least one information source to 
be searched; 

a knowledge server electrically connected to the service 
broker and to the repository, the knowledge server 30 
comprising: i) a computer memory encoded with at 
least one information source comprising a searchable 
index describing at least one set of structured and 
unstructured data objects; ii) a crawl server configured 
to manage at least one crawler, wherein said at least one 35 
crawler is configured to process a set of structured and 
unstructured data objects, generate a searchable index 
describing the set of objects, and store the searchable 
index in a corresponding information source; and iii) a 
search server configured to receive said at least one 40 
channel from the repository and process the channel 
with each of the information sources fisted in the 
channel so as to generate a search result, wherein the 
search result comprises a list of objects identified 
during the search. 45 

14. A computer system according to claim 13, wherein at 
least one user is electrically connected to the network 
interface through a computer network. 

15. Acomputer system according to claim 14, wherein the 
computer system is configured to receive a request to 50 
process a channel from said at least one user and to provide 

a corresponding search result to said at least one user. 

16. Acomputer system according to claim 13, wherein the 
set of structured and unstructured data objects includes data 
objects outside the computer system. 55 

17. Acomputer system according to claim 16, wherein the 
set of structured and unstructured data objects includes data 
objects on the Internet. 

18. Acomputer system according to claim 13, wherein the 
knowledge server further comprises: 60 

a computer memory encoded with a first information 
source comprising a searchable index corresponding to 
a plurality of sets of structured and unstructured data 
objects; 

a crawl server configured to manage a plurality of 65 
crawlers, wherein each of said crawlers is configured to 
process a unique set of structured and unstructured data 



objects and generate a searchable index describing the 
unique set of objects, and wherein each of said crawlers 
is configured to store the generated searchable index in 
the first information source. 
19. Acomputer system configured to search for structured 
and unstructured data objects, the computer system con- 
nected to a network interface that may be connected to at 
least one user through a computer network; the computer 
system comprising: 

a service broker electrically connected to the network 
interface, the service broker adapted to control a level 
of access to the computer system by a user; 
a repository electrically connected to the service broker, 
the repository comprising a computer memory encoded 
with a plurality of objects including structured data 
objects, unstructured data objects, and at least one 
channel, wherein the channel comprises at least one 
query term, a corresponding set of search properties, 
and a list identifying at least one information source to 
be searched; 

a knowledge server electrically connected to the service 
broker and to the repository, the knowledge server 
comprising: i) a computer memory encoded with at 
least one information source comprising a searchable 
index corresponding to at least one set of structured and 
unstructured data objects; ii) a crawl server configured 
to manage at least one crawler, wherein said at least one 
crawler is configured to process a set of structured and 
unstructured data objects, generate a searchable index 
describing the set of objects, and store the generated 
searchable index in a corresponding information 
source; and iii) a search server configured to receive 
said at least one channel from the repository and 
process the channel with each of the information 
sources listed in the channel so as to generate a search 
result, wherein the search result comprises a list of 
objects identified during the search; 

wherein the set of search properties includes a relevancy 
property defining a minimum, relevancy score that an 
object must have to be included in the corresponding 
search result, a document type property defining a 
document type that an object must be to be included in 
the corresponding search result, and a date modified 
property defining a date beyond which an object must 
have been modified to be included in the corresponding 
search result; 

wherein the computer memory of the knowledge server is 
further encoded with at least one set of crawler prop- 
erties corresponding to one of said at least one crawlers, 
each set of crawler properties including: 
a name of a corresponding crawler; 
a list of URLs to be processed by the corresponding 
crawler; 

a set of limits defining a number of links to follow 
beyond a starting URL; 

a first set of constraints defining terms which must be 
included in an object in order for it to be indexed by 
the corresponding crawler; 

a second set of constraints defining terms which must 
not be included in an object in order for it to be 
indexed by the corresponding crawler; 

a set of authentication properties providing access to 
secure objects in said set of structured and unstruc- 
tured data objects by the corresponding crawler; and 

a schedule defining a time when the corresponding 
crawler is to begin execution. 
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20. A computer system according to claim 19, wherein at 
least one user is electrically connected to the network 
interface through a computer network. 

21. A computer system according to claim 20, wherein the 
computer system is configured to receive a request to 5 
process a channel from said at least one user and to provide 

a corresponding search result to said at least one user. 

22. A computer system according to claim 19, wherein the 
set of structured and unstructured data objects includes data 
objects outside the computer system. 10 

23. A computer system according to claim 22, wherein the 
set of structured and unstructured data objects includes data 
objects on the Internet. 

24. A computer system according to claim 19, wherein the 
knowledge server further comprises: is 

a computer memory encoded with a first information 
source comprising a searchable index corresponding to 
a plurality of sets of structured and unstructured data 
objects; 

a crawl server configured to manage a plurality of 20 
crawlers, wherein each of said crawlers is configured to 
process a unique set of structured and unstructured data 
objects and generate a searchable index describing the 
unique set of objects, and wherein each of said crawlers 
is configured to store the generated searchable index in 25 
the first information source. 

25. A computer system according to claim 19, further 
comprising: 

an event server electrically connected to the service 3Q 
broker, the event server comprising a computer 
memory encoded with instructions for dispatching a 
channel for processing in the knowledge server accord- 
ing to a predefined schedule. 

26. A method of conducting a search in a computer system 35 
comprising a service broker, a repository, and a knowledge 
server, wherein the knowledge server comprises a computer 
memory including at least one information source, a crawl 
server configured to manage at least one crawler, and a 
search server, the method comprising: 4Q 

associating said at least one crawler with a set of struc- 
tured and unstructured data; 

assigning a set of crawler properties to said at least one 
crawler, wherein the set of crawler properties defines 
how said at least one crawler will process data objects; 45 

executing the crawler with said set of crawler properties 
and said set of structured and unstructured data so as to 
generate index data corresponding to said set of struc- 
tured and unstructured data; 

storing the generated index data in a corresponding in for- 50 
mation source in the computer memory of the knowl- 
edge server; 

receiving a search request comprising at least one query 
term, a corresponding set of search properties, and a list 
identifying at least one information source to be 55 
searched; and 

processing the search request with each of the listed 
information sources so as to generate a search result 
comprising a list of objects identified during the search. 6Q 

27. A method according to claim 26, further comprising: 
receiving a request for access to the computer system by 

a user connected to the network interface through a 

computer network; 
providing a level of access to the user based upon infer- 

mation stored in an authentication server; 
receiving a search request from the user; 
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providing the corresponding search result to the user in a 
format readable by a browser program. 

28. A method according to claim 26, wherein the set of 
structured and unstructured data objects includes at least one 
data object that is outside the computer system, wherein the 
step of executing the crawler further comprises: 

retrieving a data object from outside the computer system 
through the network interface. 

29. A method according to claim 28, wherein the set of 
structured and unstructured data objects includes at least one 
data object that is on the Internet, wherein the step of 
executing the crawler further comprises: 

retrieving a data object from the Internet through the 
network interface. 

30. A method according to claim 26, wherein the crawler 
properties includes a schedule defining a time when a 
corresponding crawler is to begin execution, the method 
further comprising: 

executing said at least one crawler at a time defined by the 
schedule in the corresponding crawler properties. 

31. A method of conducting a search in a computer system 
comprising a service broker, a repository, and a knowledge 
server, 

wherein the knowledge server comprises a computer 
memory including at least one information source, a 
crawl server configured to manage at least one crawler, 
and a search server, 

wherein the repository comprises a computer memory 
encoded with a plurality of objects including structured 
data objects, unstructured data objects, and at least one 
channel, wherein said at least one channel comprises at 
least one query term and a list identifying at least one 
information source to be searched, the method com- 
prising: 

associating said at least one crawler with a set of 

structured and unstructured data; 
assigning a set of crawler properties to said at least one 

crawler, wherein the set of crawler properties defines 

how said at least one crawler will process data 

objects; 

executing the crawler with said set of crawler proper- 
ties and said set of structured and unstructured data 
so as to generate index data corresponding to said set 
of structured and unstructured data; 

storing the generated index data in a corresponding 
information source in the computer memory of the 
knowledge server; 

retrieving a channel from the repository; and 

processing said at least one query term on the retrieved 
channel with each of the information sources listed 
in the channel so as to generate a search result 
comprising a list of objects identified during the 
search. 

32. A method according to claim 31, further comprising: 
receiving a request for access to the computer system by 

a user connected to the network interface through a 
computer network; 

providing a level of access to the user based upon infor- 
mation stored in an authentication server; 

receiving a request to execute a channel from the user; 

providing the corresponding search result to the user in a 
format readable by a browser program. 

33. A method according to claim 31, wherein the set of 
structured and unstructured data objects includes at least one 
data object that is outside the computer system, wherein the 
step of executing the crawler further comprises: 
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retrieving a data object from outside the computer system 
through the network interface. 

34. A method according to claim 33, wherein the set of 
structured and unstructured data objects includes at least one 
data object that is on the Internet, wherein the step of 5 
executing the crawler further comprises: 

retrieving a data object from the Internet through the 
network interface. 

35. A method according to claim 31, wherein the crawler 
properties include a schedule defining a time when a corre- 10 
sponding crawler is to begin execution, the method further 
comprising: 

executing said at least one crawler at a time defined by the 
schedule in the corresponding crawler properties, 

36. A method of conducting a search in a computer system 15 
comprising a service broker, a repository, and a knowledge 
server, wherein the knowledge server comprises a computer 
memory including at least one information source, a crawl 
server configured to manage at least one crawler, and a 
search server, the method comprising: 20 

associating said at least one crawler with a set of struc- 
tured and unstructured data; 

assigning a set of crawler properties to said at least one 
crawler, wherein the set of crawler properties includes 2 5 
a name of a corresponding crawler, a list of URLs to be 
processed by the corresponding crawler, and a set of 
limits defining a number of links to follow beyond a 
starting URL; 

for each URL listed in the crawler properties, executing 30 
said at least one crawler to perform the following: 

i) retrieving a data object corresponding to the URL; 

ii) generating index data corresponding to the retrieved 
data object, the index data including a name of the 
retrieved data object, a document type corresponding 35 
to the retrieved data object, a date field correspond- 
ing to the last date on which the retrieved data object 
was modified, and a set of keywords associated with 
retrieved data object; 

iii) storing the index data in a corresponding informa- 40 
tion source in the computer memory of the knowl- 
edge server; 

iv) determining a number of links which have been 
followed since retrieving the data object correspond- 
ing to the listed URL; 45 

v) if the number of links which have been followed is 
less than the number of links defined in the set of 
limits in the crawler properties, then performing 
steps a) through b); 

a) processing the retrieved data object to generate a 50 
list of hyperlinks that appear in the retrieved data 
object; 

b) for each hyperlink in the retrieved data object, 
performing the following steps aa) through dd): 

aa) retrieving a data object corresponding to the 55 
hyperlink; 

bb) generating index data corresponding to the 
retrieved data object, the index data including 
a name of the retrieved data object, a document 
type corresponding to the retrieved data object, 60 
a date field corresponding to the last date on 
which the retrieved data object was modified, 
and a set of keywords associated with retrieved 
data object; 

cc) storing the generated index data in a corre- 65 
sponding information source in the computer 
memory of the knowledge server; 
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dd) recursively performing steps iv) through v) 
for the retrieved data object corresponding to 
the hyperlink; 

receiving a search request comprising at least one query 
term, a set of search properties, and a list identifying at 
least one information source to be searched, wherein 
the set of search properties includes a relevancy 
property, a document type property, a date modified 
property, and a sorting criteria property; 

for each entry in each of the information sources listed in 
the search request, executing the search server to per- 
form the following steps: 

vi) comparing the entry in the information source with 
said at least one query term so as to generate a 
relevancy score; 

vii) if the generated relevancy score is greater than the 
relevancy property of the search request, then per- 
forming the following step c): 

c) if the document type property of the entry matches 
the document type property of the search request, 
then performing the following step ee): 
ee) if the date field of the entry is more recent 
than the date modified property of the search 
request, then storing the index data corre- 
sponding to the entry in a corresponding search 
result; 

arranging the entries in the corresponding search result 
according to the sorting criteria property of the search 
request. 

37. A method according to claim 36, wherein the set of 
crawler properties further includes: a first set of constraints 
defining terms which must be included in an object in order 
for it to be indexed by the corresponding crawler; a second 
set of constraints defining terms which must not be included 
in an object in order for it to be indexed by the corresponding 
crawler; 

the step of generating index data corresponding to the 
retrieved data object further comprising: 
aaa) comparing the retrieved data object to the first set 
of constraints and to the second set of constraints in 
the crawler properties corresponding to the crawler; 
and 

bbb) if the retrieved data object complies with both of 
the first set of constraints and the second set of 
constraints, then generating index data correspond- 
ing to the retrieved data object, the index data 
including a name of the retrieved data object, a 
document type corresponding to the retrieved data 
object, a date field corresponding to the last date on 
which the retrieved data object was modified, and a 
set of keywords associated with retrieved data 
object. 

38. A method according to claim 36, wherein the set of 
crawler properties further includes a set of authentication 
properties providing access to secure data objects in said set 
of structured and unstructured data objects by the corre- 
sponding crawler, the step of retrieving a data object corre- 
sponding to the URL further comprising: 

ccc) transmitting a request to retrieve a secure data object 
corresponding to the URL; 

ddd) receiving a request for authentication data from a 
server corresponding to the URL; 

eee) transmitting a set of authentication properties corre- 
sponding to the crawler to the server corresponding to 
the URL; and 

fff) retrieving a secure data object from the server corre- 
sponding to the URL. 
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39. A method according to claim 36, wherein the set of 
crawler properties further includes a schedule defining a 
time when the corresponding crawler is to begin execution, 
the step of executing said at least one crawler further 
comprising: 5 

executing said at least one crawler at a time defined by the 
schedule in the crawler properties. 

40. A method according to claim 36, further comprising: 
receiving a request for access to the computer system by 

a user connected to the network interface through a 30 
computer network; 

providing a level of access to the user based upon infor- 
mation stored in an authentication server; 

receiving a search request from the user; 

providing the corresponding search result to the user in a 35 
format readable by a browser program. 

41. A method according to claim 36, wherein the set of 
structured and unstructured data objects includes data 
objects that are outside the computer system, wherein the 
step of retrieving a data object further comprises: 2Q 

retrieving a data object from outside the computer system 
through the network interface. 

42. A method according to claim 41, wherein the set of 
structured and unstructured data objects includes data 
objects that are on the Internet, wherein the step of retrieving 25 
a data object further comprises: 

retrieving a data object from the Internet through the 
network interface. 

43. A method of conducting a search in a computer system 
comprising a service broker, a repository, and a knowledge 30 
server, 

wherein the knowledge server comprises a computer 
memory encoded with at least one information source, a 
crawl server configured to manage at least one crawler, 
and a search server, 35 

wherein the repository comprises a computer memory 
encoded with a plurality of objects including structured 
data objects, unstructured data objects, and at least one 
channel, wherein said at least one channel comprises at 
least one query term, a corresponding set of search 40 
properties, and a list identifying at least one information 
source to be searched, and 

wherein the corresponding set of search properties includes 
a relevancy property, a document type property, a date 
modified property, and a sorting criteria property, the 45 
method comprising: 

associating said at least one crawler with a set of struc- 
tured and unstructured data; 

assigning a set of crawler properties to said at least one 
crawler, wherein the set of crawler properties includes so 
a name of a corresponding crawler, a list of URLs to be 
processed by the corresponding crawler, and a set of 
limits defining a number of links to follow beyond a 
starting URL; 

for each URL listed in the crawler properties, executing 55 
said at least one crawler to perform the following: 

i) retrieving a data object corresponding to the URL; 

ii) generating index data corresponding to the retrieved 
data object, the index data including a name of the 
retrieved data object, a document type corresponding 60 
to the retrieved data object, a date field correspond- 
ing to the last date on which the retrieved data object 
was modified, and a set of keywords associated with 
retrieved data object; 

iii) storing the index data in a corresponding informa- 65 
tion source in the computer memory of the knowl- 
edge server; 



iv) determining a number of links which have been 
followed since retrieving the data object correspond- 
ing to the listed URL; 

v) if the number of links which have been followed is 
less than the number of links defined in the set of 
limits in the crawler properties, then performing 
steps a) through b); 

a) processing the retrieved data object to generate a 
list of hyperlinks that appear in the retrieved data 
object; 

b) for each hyperlink in the retrieved data object, 
performing the following steps aa) through dd): 

aa) retrieving a data object corresponding to the 
hyperlink; 

bb) generating index data corresponding to the 
retrieved data object, the index data including 
a name of the retrieved data object, a document 
type corresponding to the retrieved data object, 
a date field corresponding to the last date on 
which the retrieved data object was modified, 
and a set of keywords associated with retrieved 
data object; 

cc) storing the generated index data in a corre- 
sponding information source in the computer 
memory of the knowledge server; 

dd) recursively performing steps iv) through v) 
for the retrieved data object corresponding to 
the hyperlink; 

retrieving said at least one channel from the computer 

memory of the repository; 
for each entry in each of the information sources listed in 

the channel, executing the search server to perform the 

following steps: 

vi) comparing the entry in the information source with 
said at least one query term so as to generate a 
relevancy score; 

vii) if the generated relevancy score is greater than the 
relevancy property of the channel, then performing 
the following step c): 

c) if the document type property of the entry matches 
the document type property of the channel, then 
performing the following step ee): 

ee) if the date field of the entry is more recent 
than the date modified property of the channel, 
then storing the index data corresponding to 
the entry in a corresponding search result; 
arranging the entries in the corresponding search result 
according to the sorting criteria property of the channel. 
44. A method according to claim 43, wherein the set of 
crawler properties further includes: a first set of constraints 
defining terms which must be included in an object in order 
for it to be indexed by the corresponding crawler; a second 
set of constraints defining terms which must not be included 
in an object in order for it to be indexed by the corresponding 
crawler; 

the step of generating index data corresponding to the 
retrieved data object further comprising: 
aaa) comparing the retrieved data object to the first set 
of constraints and to the second set of constraints in 
the crawler properties corresponding to the crawler; 
and 

bbb) if the retrieved data object complies with both of 
the first set of constraints and the second set of 
constraints, then generating index data correspond- 
ing to the retrieved data object, the index data 
including a name of the retrieved data object, a 
document type corresponding to the retrieved data 
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object, a date field corresponding to the last date on 
which the retrieved data object was modified, and a 
set of keywords associated with retrieved data 
object. 

45. A method according to claim 43, wherein the set of 5 
crawler properties further includes a set of authentication 
properties providing access to secure data objects in said set 
of structured and unstructured data objects by the corre- 
sponding crawler, the step of retrieving a data object corre- 
sponding to the URL further comprising: no 

ccc) transmitting a request to retrieve a secure data object 
corresponding to the URL; 

ddd) receiving a request for authentication data from a 
server corresponding to the URL; 

eee) transmitting a set of authentication properties corre- 
sponding to the crawler to the server corresponding to 
the URL; and 

fff) retrieving a secure data object from the server corre- 
sponding to the URL. 20 

46. A method according to claim 43, wherein the set of 
crawler properties further includes a schedule defining a 
time when the corresponding crawler is to begin execution, 
the step of executing said at least one crawler further 
comprising: 25 

executing said at least one crawler at a time defined by the 
schedule in the crawler properties. 
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47. A method according to claim 43, further comprising: 
receiving a request for access to the computer system by 

a user connected to the network interface through a 
computer network; 

providing a level of access to the user based upon infor- 
mation stored in an authentication server; 

receiving a request to execute a channel from the user; 

providing the corresponding search result to the user in a 
format readable by a browser program. 

48. A method according to claim 43, wherein the set of 
structured and unstructured data objects includes data 
objects that are outside the computer system, wherein the 
step of retrieving a data object further comprises: 

retrieving a data object from outside the computer system 
through the network interface. 

49. A method according to claim 48, wherein the set of 
structured and unstructured data objects includes data 
objects that are on the Internet, wherein the step of retrieving 
a data object further comprises: 

retrieving a data object from the Internet through the 
network interface. 
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